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Preface 



The need to predict, understand, and optimize complex physical and che- 
mical processes occurring in and around the earth, such as groundwater con- 
tamination, oil reservoir production, discovering new oil reserves, and ocean 
hydrodynamics, has been increasingly recognized. Despite their seemingly 
disparate natures, these geoscience problems have many common mathema- 
tical and computational characteristics. The techniques used to describe and 
study them are applicable across a broad range of areas. 

The study of the above problems through physical experiments, mathe- 
matical theory, and computational techniques requires interdisciplinary colla- 
boration between engineers, mathematicians, computational scientists, and 
other researchers working in industry, government laboratories, and univer- 
sities. By bringing together such researchers, meaningful progress can be 
made in predicting, understanding, and optimizing physical and chemical 
processes. 

The International Workshop on Fluid Flow and Transport in Porous Me- 
dia was successfully held in Beijing, China, August 2-6, 1999. The aim of 
this workshop was to bring together applied mathematicians, computational 
scientists, and engineers working actively in the mathematical and numeri- 
cal treatment of fluid flow and transport in porous media. A broad range 
of researchers presented papers and discussed both problems and current, 
state-of-the-art techniques. 

Over seventy people from Australia, Bulgaria, China, France, Ghana, Ger- 
many, Norway, Russia, South Korea, Spain, Taiwan, and the United States of 
America attended this workshop and more than fifty papers were presented 
on a variety of subjects in mathematical theory, numerical methods, parallel 
computation, optimization, surface water and ocean modeling, chemically re- 
active phenomena, atmospheric modeling, multiscale phenomena, and media 
characterization. 

This book contains thirty-eight selected papers presented at the workshop. 
They cover recent advances and developments of a wide range of numerical 
issues of multiphase fluid flow and transport in porous media. The porous me- 
dia considered range from ordinary media to fractured and deformable ones. 
The physical and mathematical models treated involve a variety of flows from 
single phase compressible flow to multiphase, multicomponent flow with mass 
interchange between phases. The numerical methods studied range from stan- 
dard finite difference and finite element methods to nonstandard mixed finite 
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element and characteristics-based techniques. The computational algorithms 
developed utilize classical fast iterative solvers and modern multigrid and 
domain decomposition approaches combined with local grid refinement tech- 
niques. The characteristics-based techniques for advection-dominated flow 
and transport processes are emphasized in this book; the classical modified 
methods of characteristics to newly developed locally conservative Eulerian- 
Lagrangian methods are addressed. 

Financial support for the workshop was generously provided by the Na- 
tional Natural Science Foundation of China, the Beijing Institute of Applied 
Physics and Computational Mathematics, the Chinese State Key Basic Re- 
search Project, the US Army Research Office-Far Fast, the Office of Naval 
Research International Field Office Asia, and the US Air Force Asian Office 
of Aerospace Research and Development. We also thank the local organizers 
at the International Center of Computational Physics, Beijing, China, for 
their hard work and hospitality. 



Zhangxin Chen, Richard E. Ewing, and Zhong-Ci Shi 
May 29, 2000 
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Abstract 

Mathematical models have been widely used to predict, understand, 
and optimize many complex physical processes, from semiconductor or 
pharmaceutical design to large-scale applications such as global weat- 
her models to astrophysics. In particular, simulation of environmental 
effects of air pollution is extensive. Here we address the need for using 
similar models to understand the fate and transport of groundwater 
contaminants and to design in situ remediation strategies. 

Three basic problem areas need to be addressed in the modeling and 
simulation of the flow of groundwater contamination. First, one obtains 
an effective model to describe the complex fluid/fluid and fluid/rock 
interactions that control the transport of contaminants in groundwater. 
This includes the problem of obtaining accurate reservoir descriptions 
at various length scales and modeling the effects of this heterogeneity 
in the reservoir simulators. Next, one develops accurate discretiza- 
tion techniques that retain the important physical properties of the 
continuous models. Finally, one develops efficient numerical solution 
algorithms that utilize the potential of the emerging computing archi- 
tectures. We will discuss recent advances and describe the contribution 
of each of the papers in this book in these three areas. 



KEYWORDS: reservoir simulation, mathematical models, partial differential 
equations, numerical algorithms 

1 Introduction 

The objective of reservoir simulation is to understand the complex chemical 
and physical fluid flow processes occurring in an underground reservoir suf- 
ficiently well so as to be able to predict the fate and optimize remediation 
of groundwater contaminants. Toward that end, one must be able to pre- 
dict the performance of the reservoir under various remediation schemes. To 
do this, a model of the reservoir and its flow processes must be constructed 
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to yield information about the complex phenomena accompanying different 
remediation strategies. 

There are four major stages to the modeling process. First, a physi- 
cal model of the ffow processes is developed incorporating as much geology, 
chemistry, and physics as is deemed necessary to describe the essential pheno- 
mena. Second, a mathematical formulation of the physical model is obtained, 
usually involving coupled systems of nonlinear, time-dependent partial dif- 
ferential equations. Third, once the properties of the mathematical model, 
such as existence, uniqueness, and regularity of the solution, are sufficiently 
well understood, a discretized numerical model of the mathematical equa- 
tions is produced. A numerical model is determined that has the required 
properties of accuracy and stability and that produces solutions representing 
the basic physical features as well as possible without introducing spurious 
phenomena associated with the specific numerical scheme. Finally, a com- 
puter code capable of efficiently performing the necessary computations for 
the numerical model is developed. The total modeling process encompasses 
aspects of each of these four intermediate steps. 

The modeling process is not completed with one pass through these four 
steps. Once a computer code has been developed which gives concrete quanti- 
tative results for the total model, this output is compared with corresponding 
measured observations of the physical process. If the results do not match 
well, one iterates back through the complete modeling process, changing the 
various intermediate models in ways to obtain a better correlation between 
the physical measurements and the computational results. Often, many ite- 
rations of this modeling loop are necessary to obtain reasonable models for 
the highly complex physical phenomena describing contaminant remediation 
processes. 

The trends in reservoir modeling are contained in three broad topics: 1) 
obtaining better reservoir descriptions and incorporating these descriptions 
in reservoir simulators, 2) modeling the complex multiphase flow processes 
and developing accurate discretization schemes for these models, and 3) de- 
veloping algorithms that can exploit the potential of the emerging computing 
architectures (particularly the potential parallelism of the parallel/vector ar- 
chitectures). We will briefly discuss these major trends in this paper and 
describe the contribution of each of the papers in this book in these areas. 
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2 Reservoir Characterization 

The processes of both single and multiphase flow involve convection, or phy- 
sical transport, of fluids through a heterogeneous reservoir. The equations 
used to simulate this flow at a macroscopic level are variations of Darcy’s law. 
Darcy’s law has been derived via a volume averaging of the Navier-Stokes 
equations m which govern flow through the reservoir at a microscopic or 
pore-volume level. Reservoirs themselves have scales of heterogeneity ran- 
ging from pore-level to held scales. In the standard averaging process for 
Darcy’s law, many important physical phenomena which may eventually go- 
vern the macroscopic flow are lost. The further averaging of the reservoirs 
and fluid properties necessary to use grid blocks of the size of 10-10^ meters 
in field-scale simulators further complicates the modeling process. Certain 
techniques have been developed to address these scaling problems. One of 
them is discussed in Arbogast’s paper in this book on scaling up fine grid 
information to coarse scales in an approximation to a nonlinear two-phase 
flow problem in porous media. 

3 Model Equations for Reservoir Flow 

Although the techniques that we will discuss apply equally well to the reco- 
very of hydrocarbon and the transport of contaminants through the saturated 
or unsaturated soil zones, we will describe the multiphase flow processes in 
the terminology of transport of contaminants in groundwater. 

The simplest and the most popular is the model of fully saturated incom- 
pressible reservoirs. In this case the water (or the liquid) phase occupies the 
whole pore space and the flow is due to the nonuniform pressure distribu- 
tion. The mathematical formulation is based on the mass balance equation 
and Darcy’s law (see, e.g., 171 1^1 

V • (pu) = F and u=— — (Vp — pg) in fl, t > 0, (3.1) 

h 

where C represents a reservoir, u is the volumetric flux of water, F is 
the source or sink of fluid, p is the fluid density, K is the absolute permeability 
tensor, p is the dynamic fluid viscosity, p is the fluid pressure, and g is the 
acceleration vector due to gravity. 

Darcy’s law provides a relation between the volumetric flux in the mass 
conservation equation and the pressure in the fluid. This relation is valid for 
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viscous dominated flows which occur at relatively low velocities. 

The transport of a contaminant that is dissolved in the water is described 
by the following equation: 

+ V • ipvLc) - V • (6»DVc) + pec = Gic) in n, t > 0, (3.2) 

at 

where c is the concentration of the contaminant, D is the dispersion tensor, 
P is the reaction rate, 6 = pp, p is the porosity, and G is the source/sink 
term. The form of the diffusion/dispersion tensor D is given by 



T) = dml+ |u| [diE{n) + dt{I - E{u ))] , 

where Eij{u) = UiUj/\u\^, dm is the molecular diffusion coefficient, and di 
and dt are the longitudinal and transverse dispersion coefficients, respectively. 
In general, di ~ lOd*, but this may vary greatly with different soils, fractured 
reservoir, etc. Also, the viscosity p, in equation (3.1) is assumed to be deter- 
mined by some mixing rule. In addition to equations (3.1) and (3.2), initial 
and boundary conditions are specified. The flow at injection and production 
wells is modeled in equations (3.1) and (3.2) via point or line sources and 
sinks. 

When either an air or vapor phase or a nonaqueous phase liquid contami- 
nant (NAPL) is present, the equations describing two phase, immiscible flow 
in a horizontal reservoir are given by 



— V • IK— V(pu, - pu,g) I = q^p.. 

d(pPaSa) ( PaKa^i A 



(3.3) 



where the subscripts w and a refer to the water and air phases, respectively. 
Si is the saturation, pi is the pressure, pi is the density, k^i is the relative 
permeability, pi is the viscosity, and Qi is the external flow rate, each with 
respect to the ith phase. The saturations sum to unity and thus one of them 
can be eliminated. The pressure between the two phases is described by the 
capillary pressure 

Pc(S^u) = Pa -Pw 



Although formally the system in equations (3.1) and (3.2) seems quite 
different from system (3.3), the latter system may be rearranged in a form 
which very closely resembles the former system. To use the same basic si- 
mulation techniques in our sample computations to treat both miscible and 
immiscible displacement, we will follow the ideas of Chavent \n\ 
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The global pressure p and total velocity v formulation of a two-phase 
water (w) and air (a) flow model is given by the following equations P3: 



dp d4>{p) , ( g \ 

^aCa-y, -f V • V = — h q{x,S^,p), 

at at 

v = -KA(Vp-Ga), 

,dS, 



(3.4) 



(9(/)(p) 



^ — I" ^ ■ {fw'v — Gi — D(S'u,) • V^u,) — —Su, — 0^ + Qw, 



where Gi = KXaQwiPa — Pw)s- The global pressure and total velocity are 
defined by 




and v = v„,-|-Va, 



where Pc{Sc) = 0. Further, d/dt = cj){d/dt) + Va/Sa • V, A = Au, -I- Aa is the 
total mobility, and Xi = kri/pi, i = w, a, is the mobility for water and air. 
The gravity force G\ and capillary diffusion term D{S) are expressed as 

G,= X^P^ + XaPa ^ and B{S^) = -KXJ^^, (3.5) 

and the compressibility Ca and fractional flow of water /u, are defined by 

1 dpa 1 r Xyj 

Ca = — j — and f^ = 

Pa dpa X 

We note that in this formulation the only diffusion/dispersion term is capil- 
lary mixing described by equation (3.5). 

The equations presented above describe multiphase and multicomponent 
flow in reservoirs. They can be used to simulate various production stra- 
tegies in an attempt to understand and optimize hydrocarbon recovery or 
remediation strategies for contaminant removal. However, to use these equa- 
tions effectively, parameters that describe the soil, rock, and fluid properties 
for the particular reservoir application must be input into the model. The 
relative permeabilities, which are nonlinear functions of the water saturation, 
can be estimated via laboratory experiments using reservoir cores and resi- 
dent fluids. In the groundwater literature, often both the specific storativity 
9 — PwSw and the relative permeabilities are estimated using parameter fit- 
ting of certain function forms (see, e.g., van Genuchten j80]). The popularity 
of the van Genuchten fits comes from the fact that they produce smooth, 
differentiable functions that are easy to handle numerically. Similarly, fluid 
viscosities are relatively easy to obtain. However, the permeability tensor 
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K, the porosity (f>, the capillary pressure curve Pc(Sw), and the diffusion and 
dispersion coefficients are effective values that must be obtained from local 
properties via scaling techniques. In addition, the inaccessibility of the re- 
servoir to measurement of even the local properties increases the difficulties. 
See pni E3 ESI E31 El and the references contained therein for a survey 
of parameter estimation and history-matching techniques which have been 
applied. 

Even if complete information is known about the reservoir properties in 
a highly heterogeneous reservoir, the problem of how to represent this reser- 
voir on coarse-grid blocks of different length scales still remains. The power 
of supercomputers must be brought to bear for simulation studies using ho- 
mogenization and statistical averaging to represent fine-scale phenomena on 
coarser grids. 

Most of the papers in this book deal with the model equations presented 
in this section; other papers handle slightly different flow equations such as 
the elastic porous medium model by Chen-Ewing-Lyons-Qin-Sun-Yale, the 
non-Darcy well models by Garanzha-Konshin-Lyons-Papavassiliou-Qin, the 
lattice Boltzmann models for energy equations by Y. Li, the non-Newtonian 
flow by Ming-Shi, the omega equations by Villatoro-Garcia-Lafuente, and the 
Volterra integral equations of the first kind for the determination of capillary 
pressure functions by Subbey-Nordtvedt and Wang-Xiao. The mathematical 
and numerical techniques developed here can be possibly extended to more 
complicated model equations |21l ESI E3|. Error analysis for the numerical 
methods described in this paper for equation (3.4) has been carried out in 

ESlESlEn]. 

4 Mathematical Results 

The differential equations presented in the previous section have been studied 
in the past few decades. Existence of weak solutions to these equations 
under the assumption that the fluids are incompressible has been established 
D 0 □ moouil ES EOl EH Recently, uniqueness and regularity of the weak 
solutions has been studied EH The degeneracy and strong coupling of 
these differential equations makes it very hard to study them. In particular, 
these mathematical properties for compressible fluids have not been obtained 
yet. Also, due to the degeneracy and strong coupling, the solutions do not 
have much regularity EOIET]. Any attempt to obtain error estimates for the 
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numerical methods described in this paper and other papers in this book for 
these differential equations has to respect the minimum regularity m- 

5 Mixed Methods 

for Accurate Velocity Approximations 

There are two major sources of error in the methods currently being utilized 
for finite difference discretizations of equation (3.4). The first occurs in the 
approximation of the fluid pressure and velocity. The second comes from 
the techniques for upstream weighting to stabilize the saturation equation in 
(3.4). In this section, we describe the mixed finite element method for the 
accurate approximation of the total velocity v. Some alternate upstream- 
weighting techniques developed from a finite element context were presented 
in Ewing et al. m- 

Among the disadvantages of the conforming discretizations are the lack of 
local mass conservation of the numerical model and some difficulties in com- 
puting the phase velocities needed in the transport and saturation equations. 
The straightforward numerical differentiation is far from being justifiable in 
problems formulated in a highly heterogeneous reservoir with complex geo- 
metry. On the other hand, the mixed finite element method in offers an 
attractive alternative. In fact, this method conserves mass cell by cell and 
produces a direct approximation of the two variables of interest-pressure and 
velocity. Below we explain briefly the mixed finite element method for the 
pressure equation. 

To describe the mixed method we introduce two Hilbert spaces 

W = L^(fi), V={cpe L^(0)^ V • tp e . 

The pressure equation is written in the following mixed weak form: Find 
(p, v) G IT X V such that 

(Av,ip) - (p,V ■ ip) = (G^,ip) VipeV,t>0, 

dp ( 5 - 1 ) 

(C(p,5a)^,V') + (V-v,V') = (/(p,5J,V') VV'SIT, t>0, 

with p(0) G L^{Tt) being the given initial pressure, where C{p,Sa) = SaCa 
and A = (KA)“^. We note that A is always symmetric and positive definite 
which leads to a well defined problem. This is in contrast to system (3.3) 
where the relative permeability kra vanishes when the phase a is absent in 



Chen and Ewing 



some subregion of f2. We note that if there were nonhomogeneous boundary 
conditions on dfl they should be added to the right-hand side (/(p, 5^,), V')- 
Corresponding changes in the bilinear forms in the left-hand side should be 
introduced in the case of Robin boundary conditions. Obviously, equation 
(5.1) forms a nonlinear problem. To solve it, one can use the Picard linea- 
rization (see, e.g., CHI) or any other feasible approach. 

We triangulate the domain 12 into elements, say, simplexes, rectangular 
parallelepipeds, and/or prisms, with characteristic diameter h. Let Wh C W 
and Vh C V be the Raviart-Thomas-Nedelec jZSl EH] , the Brezzi-Douglas- 
Fortin-Marini C21, the Brezzi-Douglas-Marini C31, the Brezzi-Douglas-Duran- 
Fortin nn, or the Chen-Douglas m mixed finite element space associated 
with the triangulation and time discretization = nAt, n = 0, 1, . . .. The 
mixed finite element solution (P", V") G Wh x V?, satisfies 

(A" V", ^h) - (V • Vh. P^) = {Gl, Vh) V G Vh, 

with G Wh expressed through given initial datum. 

This is an implicit in time Euler approximation of a nonlinear problem 
which can be solved by Picard or Newton iterations. Obviously, one can 
easily formulate the Crank-Nicolson scheme. 

The resulting system of linear equations has the form of a saddle point 
problem defined on a pair of finite dimensional spaces Wh and V h- 

A 

where F GVh and G G Wh are given and P" G Wh and V" &Vh represent 
the unknown approximate solution on the time level 2„. Here A : V/i i— >■ V?i is 
a linear symmetric and positive definite operator, the linear map B^ : Wh > 
V h is the adjoint of B : Wh, and D : Wt >->■ Wh is either (l/At)M 

with M similar to the mass matrix in Wh for time dependent problems or 
0 for steady state problems. The existence and uniqueness of a solution is 
guaranteed by the fact that the pair of spaces {Wh,Vh) satisfies the inf-sup 
condition of Babuska-Brezzi EH. 

This is an indefinite system with a large number of unknowns. Such 
type of systems is more difficult to solve compared with the definite systems. 
However, the popularity of the mixed method has increased considerably as 
a consequence of the progress made in recent years in developing efficient 
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algorithms for solving this indefinite system (see, e.g., El 0 Uni CS EH E3 
El Ea EH ESI rTTjV The mixed method for approximating accurately the 
total velocity v is used in this paper. This method is further examined 
and exploited in the papers by Arbogast, Douglas-Pereira-Yeh, Garanzha- 
Konshin-Lyons-Papavassiliou-Qin, Huang-Spagnuolo, Ming-Shi, Qin-Wang- 
Ewing-Espedal, Wang-Li, Russell, D. Yang, and Yuan in this book. 



6 Characteristics-Based Techniques 

In multiphase or multicomponent flow models, the convective, hyperbolic 
part is a linear function of the velocity. An operator-splitting technique has 
been developed to solve the purely hyperbolic part by time stepping along the 
associated characteristics EIEIESIIZHI We first obtain the non-divergence 
form of equation (3.2) with 0 = 1 by using the product rule for differentiation 
on the V • (uc) term and applying equation (3.1) to obtain 



dt 



— -I- u • Vc — V • (DVc) = q{c — c) . 



( 6 . 1 ) 



Next, the first and second terms in equation (6.1) are combined to form a 
directional derivative along what would be the characteristics for the equation 
if the tensor D were zero. The resulting equation is 



V • (DVc) -I- q{c — c) = 



, dc 

^t 



u • Vc = 



dc 

'dr 



( 6 . 2 ) 



The system obtained by modifying equations (3.1) and (3.2) in this way is 
solved sequentially. An approximation for u is first obtained at time level 
t = from a solution of equation (3.1) with the fiuid viscosity /i evaluated 
via some mixing rule at time level Equation (3.1) can be solved by 

the mixed finite element method for a more accurate fiuid velocity as in the 
last section. Let C"{x) and U"(a;) denote the approximations of c{x,t) and 
u{x,t), respectively, at time level t = t". The directional derivative is then 
discretized along the “characteristic” mentioned above as 



.9c, ^ ,C"(c)-C"-i(x"-i) 



(6.3) 



where a:" ^ is defined for an x as 



V^x)At _ 



(6.4) 
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This technique is a discretization back along the “characteristic” generated by 
the first-order derivatives from equation (6.2). Equations (6.3) and (6.4) are 
useful if the characteristics do not change much in each time step. In general, 
several “micro” steps may be necessary to trace accurately the characteristic 
back through a full time step. Although the advection-dominance in the ori- 
ginal equation (6.2) makes it non-self-adjoint, the form with the directional 
derivative is self-adjoint and discretization techniques for self-adjoint equati- 
ons can be utilized. This modified method of characteristics (MMOC) can be 
combined with either finite difference or finite element spatial discretizations. 

In multiphase flow, the convective part is nonlinear. A similar opera- 
tor-splitting technique to solve the saturation equation in (3.4) needs redu- 
ced time steps because the pure hyperbolic part may develop shocks. An 
operator-splitting technique has been developed for multiphase flow Eaoa 
E3 ESI III ESI which retains the long time steps in the characteristic solution 
without introducing serious discretization errors. 

When the gravity term is ignored and the porosity of the reservoirs does 
not change with time in equation (3.4), the operator splitting gives the fol- 
lowing set of equations (with S = 3^)' 

cl)^ + 4^i^{S)-yS = cl>^=0, (6.5) 

ot ab dr 

and 

f)Q 

<(.— +V.(b™(5)5)-V-(D(,5)V5) = q(x,t), (6.6) 

with proper initial and boundary conditions. As noted earlier, the saturation 
S is coupled to the pressure/ velocity equations, which are solved by the mixed 
finite element method described in the last section. 

The splitting of the fraction flow function into the two parts f’”(S')-f b(S')S' 
is constructed m such that f"*(S') is linear in the shock region, Q < S < 
S\ < 1, and b(S') = 0 for 5i < 5 < 1. Further, equation (6.5) produces the 
same unique physical solution after a shock has been completely formed as 

— +V.(f’"(5) + b(5)5) = 0, (6.7) 

with an entropy condition imposed. This means that, for a fully developed 
shock, the characteristic solution of equation (6.5) always produces a unique 
solution and, as in the single-phase case, we may use long time steps At 
without loss of accuracy. 
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The solution of equation (6.6) via a variational method leads to the fol- 
lowing Petrov-Galerkin equations: 



where Mh and Nh are the trial and test spaces spanned hy {9 i} and = 

1,2, . . . , N, respectively. given by equation (6.8) is a nonsymmetrical 

bilinear form with spatially-dependent coefficients. 

To obtain equation (6.8), we have used the characteristic solution from 
equation (6.5) to approximate {d/dT)S and the nonlinear coefficients in equa- 
tion (6.6). The nonsymmetry in the bilinear form is caused by the 

nonlinearity of the convective part of the equation, represented by the term 
b(5')5'. This term balances the diffusion forces in the shock region after a 
traveling front has been established. 

Unfortunately, the MMOC techniques described above generally do not 
conserve mass. Also, the proper method for treating boundary conditions in a 
conservative and accurate manner using these techniques is not obvious. Re- 
cently, Celia, Russell, Herrera, and Ewing have devised Eulerian-Lagrangian 
localized adjoint methods (ELLAM) [ESI EE], a set of schemes that are defined 
expressly for conservation of mass properties. 

The ELLAM formulation was motivated by localized adjoint methods HSl 
EH, which are one form of the optimal test function methods EIEZIEHH 
We briefly describe these methods. Let 



denote a partial differential equation in space or space-time, where J is the 
time interval of interest. Integrating against a test function (f>, we obtain the 
weak form 



If we choose test functions </> to satisfy the formal adjoint equation L*cj> = 0 
and (() = 0 on the boundary, except at certain nodes or edges denoted by 
£i, then integration by parts (the divergence theorem in higher dimensions) 
yields 




( 6 . 8 ) 



Lu = /, X GE = Q or (x,t) G E = Q X J 
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Various different test functions can be used to focus upon different types of 
information. Herrera has built an extensive theory around this concept; see 
m for references. The theory is quite general and can deal with situations 
where distributions do not apply, such as when both u and (p are disconti- 
nuous. 

As in the work of Demkowitz and Oden we want to localize these 
test functions to maintain sparse matrices. Certain choices of space-time test 
functions which are useful for linear equations of the form (3.2) have been 
described in 

The ELLAM techniques have been extended to a wide variety of appli- 
cations IHlOHlEZlESEiEniEIlElElEIlEaiHl- Optimal order error 

estimates have been developed for advection isni, advection-diffusion EZ1IH3, 
advection-reaction | in7irainiitiHniK^iRr!j . and advection-diffusion-reaction 
EH EH systems. These techniques are further studied and applied in the pa- 
pers by DeVore-Wang-Liu-Xu, Qin-Wang-Ewing-Espedal, Wang-Al-Lawatia, 
and Wang-Liang-Ewing-Lyons-Qin in this book. 

Recently, in the study of computational geosciences, a new locally con- 
servative Euler-Lagrangian method (LCELM) was introduced by Douglas, 
Pereira, and Yeh This technique is an extension of the characteristic- 
mixed method for transport-dominated diffusion processes introduced by Ar- 
bogast and Wheeler |S(. The extension properly treats nonlinear problems. 
It was shown m that the LCELM technique is superior to the MMOC and 
the modified method of characteristics with adjust advection (MMOCAA) 
m techniques. The LCELM conserves mass locally, the MMOCAA does it 
globally, and the MMOC does not at all. The LCELM technique is further 
considered in the papers by Douglas-Pereira-Yeh and Huang-Spagnuolo in 
this book. Both the LCELM and characteristic-mixed techniques in jS| can 
be formulated in the ELLAM framework. 

Because of its simplicity, the MMOC technique is still popular. It is 
being applied for compressible flow problems in the papers by Cheng- Wang, 
D. Yang, Yu-Wu, Yuan, and Zhao in this book. 

7 Local Grid-Refinement 

and Domain Decomposition Techniques 

Many time-dependent fluid flow problems involve both large-scale processes 
and highly localized phenomena that are often critical to the overall chemi- 
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cal and physical behaviors of the flows. For large-scale applications, it is 
frequently impossible to use a uniform grid which is sufficiently fine to re- 
solve the local phenomena without yielding numbers of unknowns that will 
overburden even the largest of today’s supercomputers. Since these local 
processes are often dynamic, efficient numerical simulation requires the abi- 
lity to perform dynamic self-adaptive local grid refinement. The need for 
adaptive techniques has provided the impetus for the development of local 
grid-refinement software tools, some of which are used in day-to-day applica- 
tions for small- to mid-size problems. Software and engineering tools capable 
of dynamic local grid refinement need to be developed for large-scale, fluid 
flow applications. The adaptive grid-refinement algorithms must also be clo- 
sely matched with the architecture features of the new advanced computers 
to take advantage of possible vector and parallel capabilities. 

Normally, local refinement must be performed if a fluid interface is lo- 
cated within the coarse-grid block to resolve the solution there. A slightly 
different strategy is to make the region of local refinement big enough so that 
we can use the same refinements for several of the large time steps. This local 
patch refinement technique |SJ EH ED C3 has proved to be very effective for 
obtaining local resolution around fixed singular points such as wells in a reser- 
voir. Adaptive grid-refinement techniques utilizing the patch technique have 
been presented in several surveys (see, e.g., The patch technique 

has been incorporated efficiently in existing multiphase industrial reservoir 
simulation codes. Results for the SPE Comparison projects number 1 and 2 
were presented in m- The local refinement was both efficient and effective 
since excellent results were obtained without destroying the efficiency of the 
original codes. 

The difficult problem with these techniques is the communication of the 
solution between the fine and coarse grids. The domain decomposition tech- 
nique described in gives accurate and efficient treatment of the commu- 
nication problem. General domain decomposition techniques are applied and 
analyzed for flow problems in the papers by Alboin-JaffrARoberts-Wang- 
Serres, Bastian-Chen-Ewing-Helmig-Jakobs-Reichenberger, Li-Ma, Ma-Zhu, 
Reme-Oye-Espedal-Fladmark, Sheen, Shi-Xu, D. Yang, and H. Zhang in this 
book. 
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Abstract 

A variety of models are considered: one-phase flow in a porous 
medium, two-phase flow in a porous medium with two rock types, 
and one-phase flow in a porous medium with fractures. For each of 
these models the domain of calculation is divided into subdomains 
corresponding to the physics of the problem. Then it is shown how 
to rewrite the problems as interface problems to use nonoverlapping 
domain decomposition. 



KEYWORDS: porous media flow, domain decomposition 



1 Introduction 

Domain decomposition methods have been studied for the most part as al- 
gebraic tools for solving problems on parallel machines: see UDI for a review 
of these methods. However, in many models of flow in porous media ari- 
sing from environmental problems in the subsurface as well as from reservoir 
simulation, the domain of calculation is naturally divided into subdomains 
corresponding to the physics of the problem. Therefore, it is reasonable 
to construct nonoverlapping domain decomposition methods which can take 
into account the coupling of the physical phenomena taking place in the sub- 
domains. In these methods one rewrites the global problem as a problem 
with unknowns on the subdomain interfaces. 

After presenting the single phase flow in §2, we show in §3 and §4 how it 
applies to more complex problems: two-phase flow in a porous medium with 
two rock types and one-phase flow in a porous medium with fractures. 

Z. Chen, R.E. Ewing, and Z.-C. Shi (Eds.): Proceedings 1999, LNP 552, pp. 22-^^ 2000. 

(c) Springer-Verlag Berlin Heidelberg 2000 
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T n2 



Figure 1: The domain divided into two subdomains ili and Q2- 

2 Single Phase Darcy Flow 

We consider first the simple case of an incompressible single phase flow in a 
porous medium. The flow is governed by the equations 

divtp = 0, (f = —K'S/p, in n, (2-1) 

where is a polygonal domain. The unknowns 0 and p are the Darcy velocity 
and the fluid pressure and the coefficient K is the absolute permeability which 
may depend on x G fl. To equations (2.1) we add the boundary conditions 

p = Pd on d flu, 0 ' n = Qd ondfl]s[, (2-2) 

where n denotes the outward normal to fl. dfljy is the part of the boundary 
of 12 supporting Dirichlet boundary conditions and 912 jv the part supporting 
Neumann boundary conditions, with 912 = 912 U 912 jv- 

Now we divide the domain 12 into two polygonal subdomains 12i and 122 
and we denote by F the interface between the subdomains: F = 12i fl 122 (see 
Fig. P). 

If we denote by (0i,Pi),i = 1,2, the restriction of the solution of the 
system of equations (2.1) and (2.2) to the subdomain 12^, then we have 

div(^i = 0, 0i = -KiVpi in 12j, 

Pi = Pd on 912i n 912 £), 0 ■ rii = qd 

with the transmission conditions 

Pi = P2, • ni = • n2 

Here is the outward normal to 12^. These conditions express continuity of 
the pressure and mass conservation. 

Let us now discretize the subdomains 12^ with a mesh % of triangles or 
parallelograms. To simplify we will assume that the two subdomain meshes 



on 912i n 912 AT, 



(2.3) 



on F. 



(2.4) 
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are conforming in the sense that their union forms a regular discretization of 
the whole domain ft. We denote by £i the set of edges associated to Ti- 
To approximate the problem we use the Raviart-Thomas mixed finite ele- 
ments of lowest order 0IH For this purpose we introduce the approximation 
spaces 



in this space are uniquely defined by their flux through the edges of £i. The 
Darcy velocity (fii is calculated in this space. Pq denotes the space of constants 
and Mi and N are spaces of piecewise constant functions defined on £ti and 
on r respectively. The pressure pi inside £li is approximated in Mi and 
the pressure on F denoted by A is approximated in N . We will use the same 
notation for the approximating functions as for the solution of the continuous 
problem and we assume now that the boundary data functions pd and qd are 
constant on each interval of the discretized boundary d£l. 

To solve the problem by nonoverlaping domain decomposition techniques 
we follow ideas from to reduce the problem in to an interface problem 
on F. We introduce Dirichlet-to-Neumann operators Si associated to each 
subdomain fli as follows. Given Ai in N we solve the problem 

Find cpi G Xi(qd), Pi G Mi such that 
[ diY0i = 0 , 



Find Ai S iV, i = 1, 2, such that Ai = A 2 , 5i(Ai) -I- 52(A2) = 0. (2.7) 



^i{9) = {'!?€ RTi){£li) \ v ■ fii = g on dfli fl dflN}, 

M, = {g e L\n,) I g,c € Po{C),C GJl}, iV = J] ME)- 



Eer 



Here i?To(rii) denotes the Raviart-Thomas space of lowest order functions 




(2.5) 



Then we define Si by 

5i(Ai) = fli \t . 

Our problem can now be rewritten as the interface problem 



(2.6) 



The first equality corresponds to pressure continuity while the second corre- 
sponds to mass conservation (see equation (2.4)). The Dirichlet-to-Neumann 
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operators Si are affine. Denote by Si the linear part of Si and by Si the 
constant part: Si(Ai) = Si(Xi) + Si. With A = Ai = A 2 , the domain decom- 
position method is reduced to the linear problem 



where F = Si + S 2 - One can now apply a conjugate gradient method to 
calculate A. Preconditionners have been studied. For instance a Neumann- 
Neumann preconditionner has been presented and analyzed in jS|. For the 
case of a decomposition with many subdomains where some do not touch the 
boundary of fl, the balancing domain preconditionner is robust with respect 
to strong variations of the permeability K am 

In the following sections we show how these domain decomposition tech- 
niques apply to a variety of situations. 

3 Two-Phase Flow with Two Rock Types 

3.1 Formulation of the Problem 

We consider two-phase incompressible flow and we assume that the domain fl 
is divided into two subdomains fli, each subdomain corresponding to a rock 
type. This means that not only are the porosity and the absolute permeability 
different in fii and in 122 but the relative permeability and capillary pressure 
curves are also. 

Two-phase flow is formulated in terms of a saturation equation and a 
pressure equation using the global pressure [2|. We assume the flow to be 
incompressible and we neglect gravity. 

The saturation equation expresses volume conservation for the wetting 
phase (which is equivalent to mass conservation since the flow is assumed to 
be incompressible), so inside each subdomain 12^ we have 



where Si = S^i is the saturation of the wetting phase (0 < 5^ < 1 ). Here 
d>i and Ki denote the porosity and the absolute permeability, and ipi is the 
total Darcy velocity, the sum of the Darcy velocities of the wetting and the 
nonwetting phases: 



Find A G iV such that (5i -I- 52)A = F 



( 2 . 8 ) 




(3.1) 



— ^wi H” ^nwi’ 
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The coefficients at and bi depend on the mobilities k^i and knwi and the 
capillary pressure pd which are functions of the saturation: 



h h 



wi 



dpc 



dS ’ 



b,= 



The capillary pressure is pd = Pnwi — Pwi where Pnwi and p^i denote the 
pressures in the nonwetting and wetting phases. 

Plugging the first equation of (3.1) into the second, one obtains for sa- 
turation equation a nonlinear parabolic equation of diffusion-advection type. 
The vector fy, the diffusive contribution to ifwi, is due to capillary effects and 
fi, the advective contribution to pwi, depends on the total Darcy velocity ipi 
which is given by the pressure equation that we now describe. 

The pressure equation expresses the conservation of the total volume of 
the two phases. Since the flow is assumed to be incompressible this takes the 
form 

div(pi = 0, Pi = -Kidi{Si)Vpi, (3.2) 

where the global pressure pi is given by 

Pi — Pnwi) ^i{S). ( 3 - 3 ) 

The coefficients ■ji and di are functions of the saturation S: 






di — kn,i kn 



Continuity of the phase pressures p^i and Pnwi implies that the capillary 
pressure pd, and consequently the saturation Si, is continuous, and that the 
global pressure pi is also continuous (see definition (3.3)). Also, because of 
phase conservation, the normal components of the phase Darcy velocities 
Pwi and Pnwi, and consequently the normal components of the total Darcy 
velocity pi, are continuous across any hypersurface. 

Now we come to the transmission condition across T and we assume here 
for sake of simplicity that the two capillary pressure curves have the same 
endpoints. 

Across the interface T we still have phase conservation and continuity of 
the phase pressures. This latter condition implies that the capillary pres- 
sure is continuous since it is the difference of the phase pressures, and that 
the quantity p — 7 is continuous (see equation (3.3). Thus for the pressure 
equation we have the transmission conditions 



Pi - 7i(5'i) =P2- ' 72 ( 82 ), Pi-fii = p2- fl2. 



(3.4) 
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This implies that in general the global pressure p is discontinuous across T. 
The second equation of (3.4) enforces conservation of the global mass of the 
two phases. 

For the saturation equation the transmission conditions are 

Pci(Si) = Pc 2 (S 2 ), 0wi ■ fii = 0w2 ■ n 2 on r. (3.5) 

Thus the saturation is discontinuous in general; the second equation enforces 
conservation of the mass of the wetting phase. One should note that the first 
equation is a nonlinear transmission condition for the saturation. 



3.2 The Pressure Equation 



To equations (3.2) we add boundary conditions (2.2). Equation (3.4) shows 
that the situation differs now from that in §2 in that the pressure p is dis- 
continuous across T with a given jump. 

We discretize with the same ideas as in §2, and we introduce the Dirichlet- 
to-Neumann operator Si which associates to Xi G N the flow rate pi-rii where 
ifii is the solution of 



Find 0i G Xi{qd), pi G Mi such that 

/ diwpi = 0 , 

Jcii 

/ {Kidi{Si))~'^(pi - V - / Pi divv+ / pd 

J Cli J J dVtifyV ]D 



(3.6) 



V ■ Ui 



+ J + Xi) V ■ Hi = 0, V G Xi{0). 



With this definition of the Dirichlet-to-Neumann operators the problem re- 
duces again to problems (2.7) and (2.8). In this case A represents p — 'y{S). 



3.3 The Saturation Equation 

To equations (3.1) we add the boundary conditions 

S = Sd on dflsD, 0w-H = q^d on dflsN- (3.7) 

Discretizing the saturation equation is more complex than discretizing the 
pressure equation. Indeed it is a nonlinear parabolic equation, often advec- 
tion dominated, with a diffusion term which degenerates when the saturation 
is minimum or maximum. We propose the use of a semi-implicit Euler di- 
scretization in time 0]. When calculating the saturation at the n -I- 1 time 
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level the advection term is lagging in time at the n time level and calculated 
with upstream values of the saturation, while the diffusion term is calculated 
at the n + 1 time level with the nonlinear coefficient a lagging also at the 
previous time level. Thus at each time step one has to solve only a linear 
system to calculate the saturation. 

We assume that the data Sd and q^d are constant on each interval of the 
boundary dfl and the approximation spaces for Si and if^i are the same as 
in the previous sections. Actually since the advective part of the flow fi is 
calculated at the previous time level, the main flow unknowm is the diffusive 
part of the flow fy. 

To simplify the presentation, we assume that the domain is rectangular, 
discretized with rectangles with sides parallel to the x\ and X 2 coordinates 
axes so we can use a cell-centered finite volume method. Also the interface 
r is supposed to be parallel to the X 2 axis as in Fig. E We denote by 
d fliB , d ^iT , d fliL and dfliR the bottom, top, left and right parts of the 
boundary dfli. 

With these assumptions the discretized saturation equation is 



Find fi S Ai(r^), Si S Mi such that 

r on+1 on r 

/ ^ ^ + / Kt = ^ VCS7-, 

Jc Jqq 

Kt = • rfc + F^, 

F^ = 0^-nch{S'^_), 



;»n+l 



cn+1 

°iCEi 

on-K 

°iCE2 

on-K 

°iCE2 

on-K 

^^Cei 



on-t-1 S^~^^ 

■ He \e= -X," VA S A ^ dfisN, 

tlE 

= Sd when E <Z d ft SD f^{dftiB^ dfliL), 

= Sd when E C dflsDf^{dfliT'SdfliEi), 

= Ai when E gT, i = 1, 

= A 2 when E gT, i = 2, 



(3.8) 



with the transmission conditions 



Pci(Ai) = Pe2(A2), = 0 on F. (3.9) 

We used the notation: He is the outward normal to dC, He the normal to 
the edge E pointing in the positive x\ direction if E is vertical or pointing in 
the positive X 2 direction if E is horizontal, Sf_ is the saturation value which 
is upstream with respect to 0^, and — F". When E is an interior 
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edge we denote by Cei and Ce2 the two cells adjacent to E, denotes the 
harmonic average of K in these two cells, S^e denotes the standard average 
of and and He is equal to the space discretization step. When 

if is a boundary edge is just the value of K in the neighbouring edge 
and He is equal to half the space discretization step. 

To rewrite problem (3.8) and (3.9) as an interface problem we proceed 
as before and we introduce for each subdomain the linear Dirichlet-to- 
Neumann operator Si defined as 

5,(A,) = (r7+i-n. + Ff) |r, 

where is calculated by solving equations (3.8) inside each subdomain 
fli with a given on T. 

Our problem can now be rewritten as the interface problem 
Find Xi € N,i= 1,2, such that 

(3.10) 

Pcl(Al) = Pc2(A2), 5i(Ai) + 52(A2) = 0. 

One observes that the first equation is nonlinear and that the second equation 
implies that r • n is discontinuous across F. One way to solve problem (3.10) 
is to use incomplete Newton iterations with a preconditionned GMRES. 
Remark. It is possible to make the diffusion term fully implicit. This would 
lead to use a nonlinear Dirichlet-to-Neumann operator. 

3.4 A Numerical Example 

As an example we consider the displacement of a nonwetting fluid by a wet- 
ting fluid with a mobility 10 times larger. The domain of calculation has 
two regions with different rock types, that on the left having an absolute 
permeability 5 times larger than that on the right. The injection is parallel 
to the interface F between the two rock types (see Fig. n. 

The capillary functions are given by the standard formula 

P,{S) = J{S) 

and are shown in Fig. 0 Since K is larger in fli than in fl2, the capillary 
pressure is smaller in fli than in fl 2 , so at the interface we can expect a 
discontinuous saturation smaller in fli. Figure 0 shows numerical results at 
a certain time. The picture on the right shows the total Darcy velocity field 
ifi at a certain time. Since the injection rate is constant along the bottom 




30 



Alboin, Jaffre, Roberts, Wang, and Serres 




Figure 2: 



An example of a displacement in a medium with two rock types. 



Pc 




Figure 3: Capillary pressure curves, pd lower and pc 2 upper. 



boundary and the permeability is low on the right the Darcy velocity in fl 2 
turns to the left in the vicinity of the injection boundary. The picture on 
the left shows the saturation of the injected fluid at the same time. One 
can observe the discontinuity at the interface. Note that along this interface, 
where the saturation is not equal to zero, the saturation is smaller on the left 
than on the right because the capillary pressure curve is smaller on the left 
than on the right. However further away from this interface it is the opposite: 
the saturation, as the absolute permeability, is larger on the left than on the 
right. 
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Figure 4: Calculated total Darcy velocity (left) and saturation (right). 



4 Domain Decomposition with Fractures 

4.1 Formulation of the Problem 

In this section the domain fl is divided into two subdomains Hi, f = 1, 2, by a 
fracture F which is also a porous medium but with higher permeability. This 
fracture is assumed to have a width small compared to the size of the whole 
domain, so in the numerical model it is modelled as the interface F between 
the subdomains. 

Here we are interested in understanding the interaction between the flow 
in the subdomains and the flow in the fracture. We assume that the flow 
in the subdomains as well as in the fracture is governed by Darcy’s law, is 
incompressible and we neglect gravity. 

Interaction between the fracture and the subdomains is assumed to sa- 
tisfy mass conservation and pressure continuity. Thus we consider the set of 
equations: in the subdomains: 

div((^i) =0, (fi = -KiVpi in 

Pi = Pf on dfli n F, Pi = Pd on dfli fl dflu, pi ■ Hi = pd on dfli fl dflpf, 
and in the fracture: 

^ ^ ^ _ dpf ^ 

- — = ipi-m + ip 2 - ri 2 , Pf = -cTf Kf - — on F, 

oxf oxf (4.1) 

Pf=Pf,d ondTo, Pf = Qfd ondT^. 
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Here Kf, aj, pf and ipf denote the permeability, the width, the pressure 
and the flow rate in the fracture, and djdxf denotes the derivative along the 
fracture. 

At the extremities of the fracture there are Dirichlet (on SFd) or Neu- 
mann (on OTn) boundary conditions. If touches didi fl didi) then pres- 
sure continuity implies that the pressure data must satisfy pd = Pfd- 

The first equation in (4.1) expresses mass conservation for the flow in 
the fracture. The righthand side in this equation is the contribution of the 
subdomain flow to the fracture flow. 

We proceed now as in §2, defining the same Dirichlet-to-Neumann opera- 
tors (2.5) and (2.6) and using the same notation. Moreover on F the pressure 
Pf is approximated in a space Mfoi functions constant on each interval, while 
the flow rate Pf is approximated in Xf(qfd) a space of continuous piecewise 
linear functions which are equal to qfd on 9Fjv. 

This results in the interface problem 

Find (pf G Xf{qfd), Pf G N such that 

J^S,(p,) + S,{p,) + ^=0. 

Kfy^ipfV- J^Pf-^ + {pfdv) larj, =0 WvGXf{qo). 

Comparing with problem (2.8) we observe that the problem to be solved here 
is a global equation on the interface F. 

4.2 A Numerical Experiment 

To illustrate the model we consider an ideal dimensionless problem. The 
domain is an horizontal rectangular slice of porous medium, of dimensions 
2x1, with a given pressure on the left and right boundaries and no flow 
conditions on the top and bottom boundaries. In the domain the permeability 
is equal to one. The domain is divided into two equally large sub-domains by 
a linear fracture parallel to the X 2 axis. In the fracture we chose ct/ Kf = 2. 
For example the fracture could be of width 0.1 and could have a permeability 
equal to 20. 

Two cases are considered. A symmetric case where pressures on the left 
and on the right boundaries of the domain are equal. So the flow is driven 
only by the fracture and is symmetric. In the other case there is a pressure 
drop from the right boundary to the left one. Then the flow is a combination 
of the flow in the fracture and that going from left to right in the rest of 
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Figure 5: Calculated Darcy’s velocity for a symmetric and a nonsymmetric flow 
pattern 

the porous medium. Flow in the fracture is driven by a pressure drop of 10 
between the two extremities of the fracture for the first case and a pressure 
drop of 5 for the second case. 

Numerical results are shown on Fig. 0 Arrows represent the flow field 
with length proportional to the magnitude of the velocity. The gray scale 
represents the magnitude of the velocity with the lightest color corresponding 
to the largest velocity. We see that there is actual flow interaction between 
the fracture and the rest of the porous medium. In particular one can observe 
that some fluid is coming out of the fracture and then is coming back into it. 
In the nonsymmetric case we notice also that even though most of the flow 
is attracted into the fracture, there is still some flow on the left part of the 
domain pointing toward the left. 

5 Conclusions 

Studying a few examples of flow in porous media, we met a variety of trans- 
mission conditions which are nonstandard: discontinuity of the scalar varia- 
ble (pressure or saturation) , discontinuity of the flow rate variable (capillary 
flow), nonlinear transmission condition (saturation), nonlocal transmission 
condition (porous medium with fractures). In spite of this variety of situati- 
ons we showed that domain decomposition techniques based on Dirichlet-to- 
Neumann operators can be used to set these problems as interface problems 
to be solved using domain decomposition algorithms. 



References 

[1] Brezzi, F. and Fortin, M., Mixed and Hybrid Finite Element Methods, 
Springer Verlag, Berlin, 1991. 





34 



Alboin, Jaffre, Roberts, Wang, and Serves 



[2] Chavent, G. and Jaffre, J., Mathematical Models and Finite Elements 
for Reservoir Simulation, volume 17 of Studies in Mathematics and its 
Applications. North Holland, Amsterdam, Amsterdam, 1986. 

[3] Cowsar, L., Mandel, J., and Wheeler, M., Balancing domain decompo- 
sition for mixed finite elements. Math, of Comp. 64 (1993), 989-1015. 

[4] Douglas, J., Jr. and Dupont, T., Galerkin methods for parabolic equa- 
tions, SINUM7 (1970), 575-626. 

[5] Glowinski, R. and Wheeler, M., Domain decomposition and mixed finite 
element methods for elliptic problems, in Glowinski, R. et ah, editor. 
Proceedings of the First Symposium on Domain Decomposition Methods 
for PDFs, SIAM, Philadelphia, 1987, 144-172. 

[6] Le Tallec, P., De Roeck, Y.-H., and Vidrascu, M., Domain decomposition 
methods for large linearly elliptic three dimensional problems, J. Comp. 
Appl. Math. 34 (1991). 341-362. 

[7] Mandel, J., Balancing domain decomposition. Comm, in Numerical 
Methods in Engineering 9 (1993), 233-241. 

[8] Raviart, P.-A. and Thomas, J.-M., A mixed finite element method me- 
thod for second order elliptic problems, in I. Galligani and E. Magenes, 
editors. Mathematical Aspects of Finite Element Methods; Lecture No- 
tes in Mathematics 606, Springer, Berlin, 1977, 292-315. 

[9] Roberts, J. E. and Thomas, J.-M., Mixed and hybrid methods, in P.G. 
Giarlet and J.L. Lions, editors. Handbook of Numerical Analysis Vol.H, 
North Holland, Amsterdam, 1991, 523-639. 

[10] Smith, B., Bjorstadt, P., and Gropp, W., Domain Decomposition: Par- 
allel Multilevel Methods for Elliptic Partial Differential Equations, Gam- 
bridge University Press, 1996. 




Numerical Subgrid Upscaling 
of Two-Phase Flow in Porous Media 

Todd Arbogast 



Abstract 

We present an approach and numerical results for scaling up fine 
grid information to coarse scales in an approximation to a nonlinear 
parabolic system governing two-phase flow in porous media. The tech- 
nique allows upscaling of the usual parameters porosity and relative 
and absolute permeabilities, and also the location of wells and capil- 
lary pressure. Some of these are critical nonlinear terms that need 
to be resolved on the fine scale, or serious errors will result. Upsca- 
ling is achieved by explicitly decomposing the differential system into 
a coarse-grid-scale operator coupled to a subgrid-scale operator, which 
we localize by imposing a closure assumption. We approximate the 
coarse-grid-scale operator with a mixed finite element method that 
has a second order accurate velocity coupled implicitly to the subgrid 
scale. The subgrid-scale operator is approximated locally by a first 
order accurate mixed method. A numerical Greens influence function 
technique allows us to solve these subgrid problems independently of 
the coarse-grid approximation. No explicit macroscopic coefficients 
nor pseudo- functions result. The method is easily seen to be optimally 
convergent in the case of a single linear parabolic equation. 



KEYWORDS: upscaling, subgrid, numerical Greens functions, porous media 

1 Introduction 

In many physical problems, there are scales that are too fine to resolve on 
any reasonable computational mesh. The objective of upscaling or homoge- 
nization is to replace the governing equations by a simpler set of equations 
for which the solution can be resolved on a reasonable coarse-scale mesh and 
approximates the average behavior of the solution of the governing equations. 
In its simplest form, one replaces the coefficients of the governing equations 
with an effective or macroscopic coefficient 0. This works well in certain 
situations 0IH, but not so well in others. Often it is necessary to change the 
form of the governing equations to obtain a suitable coarse-scale model |^; 

Z. Chen, R.E. Ewing, and Z.-C. Shi (Eds.): Proceedings 1999, LNP 552, pp. 35-^^ 2000. 

(c) Springer-Verlag Berlin Heidelberg 2000 
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there is no a priori reason to expect otherwise. Such is especially the case 
when nonlinearities are present, since it is well known that a function of an 
average is not the average of the function. Various techniques are used in 
this case, including homogenization miiii!, the definition of pseudo-functions 
(altered forms of the nonlinear functions that appear in the governing equa- 
tions), the use of renormalization methods jH|, and many other techniques. 

In terms of the simulation of flow and transport in a porous medium, 
our goal for nonlinear upscaling in this paper will be to resolve some of the 
finer scales in the solution directly, so that no loss of accuracy due solely to 
averaging will result. We will then be able to incorporate directly into the 
simulation relative and absolute permeability, porosity, capillary pressure, 
and well location information on scales smaller than the computational grid. 
That is, our nonlinear functions such as relative permeability and capillary 
pressure need not be modified, since the fine scales have been sufficiently re- 
solved. Our technique is based on numerics. We assume that a fine grid fully 
represents the important physical scales, and that our computational grid is 
somewhat coarser. That is, perhaps some other homogenization technique 
has elevated the problem to a reasonable fine scale, but this fine scale is still 
too fine to compute over. 

We present our ideas by considering first a problem representing incom- 
pressible, single-phase flow in a porous medium in the absence of gravity: 

V-u = /, u=-KVp, (1.1) 

where p is the pressure, K is the permeability divided by the fluid viscosity, u 
is the Darcy velocity, and / represents the wells. For simplicity, set u • = 0 

on the external boundary. 

As an outline of the paper, we present in §2 a derivation of the upscaled 
equations for single phase flow, including a definition of our closure assump- 
tion. In §3 we give a mixed finite element approximation of the equations 
that is compatible with the closure assumption. A solution technique based 
on the computation of numerical Greens functions is given in §4. The accu- 
racy of the method is discussed in §5. Finally in §6, we present briefly the 
two-phase problem, followed in §7 by some numerical results. 
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2 Derivation of the Upscaled Equations 

We rewrite (1.1) in variational form as: Find p G W = and u G V = 
H{div) such that 

f V ■ uw dx = f f w dx Vw G W, 

(2.1) 

JK ^ u ■ V dx = — f Vp ■ V dx = fpV-vdx VvGV, 

where Jl(div) = {v G : V-v G L‘^,'v-v = 0 on the external boundary}. 

Let Wc and Vc be the computationally resolvable parts of W and V, and 
6W and SV the remainders. That is, 

W = W^®SW, V = V, © (5V, 

p = Pc + Sp G Wc® SW, u = Uc + i5u G Vc © SV. 

2.1 Separation of Scales 

Separate the fine and i5-scales by restricting to appropriate test functions in 
the variational formulation. For the coarse scale, we have 

f V • (uc + (5u) Wcdx = J fwc dx 
f K~^ (uc + (5u) ■ Vcdx = f (pc + Sp) V -Vcdx 
and for the (5-scale, 

V • (uc + (5u) Swdx = J f Sw dx 
f K~^ (uc + (5u) ■ Sv dx = f (pc + Sp) V ■ Svdx 

If we were to ignore the i5-scales (i.e., perform no upscaling), then we would 
simply set (5u = 0, = 0, and use only the coarse equation (2.2). Upscaling 

concerns the treatment of these other terms and (2.3). 

Given (uc,Pc, /), we can solve for 

<5u = $„(Uc,Pc,/) and Sp = ^p(Uc, Pc, f), (2.4) 

where 4> is a multi-linear operator. Thus (2.2) becomes 

/ V • (Uc + ^u(Uc,Pc,f))Wcdx = J f Wcdx Vwc G Wc, 

/iF-i(Uc + 4>ti(Uc,Pc,/))-Vcdx (2-5) 

= J{Pc + ^p{Uc,Pc,f))V - Vcdx VVcGVc, 

posed only on the coarse scale. We remark that no approximation has been 
made yet; all scales are fully resolved by (2.3)-(2.5). However, these two 
equations are intrinsically coupled, since <I> is a nonlocal operator. 



Vwc G Wc, 
Vvc G Vc, 



( 2 . 2 ) 



y Sw G SW, 
V(5v G SV. 



(2.3) 
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2.2 Closure Assumption (Localization Approximation) 

Define a coarse computational grid and assume that 

SY • z/ = 0 on dEc (2-6) 

for each coarse element E^- Then (i.e., (2.3)) becomes a local operator: 

V • (uc + i5u) Jw da; = f Sw dx ^ 5w £ 5 W\e^, 

/g K~^ (uc + du) ■ 5'v dx = /g (pc + 5p) V ■ Svdx Vdv G dVI^^,. 

(2.7) 

Condition (2.6) is our closure assumption. We have assumed that all net flux 
between coarse elements occurs only on the coarse scale. 

3 Mixed Finite Element Approximation 

We assume that nested fine and coarse computational grids are used, and 
let h and El be the grid spacings, respectively. The fine grid is assumed to 
be essentially what is needed to fully resolve the physical scales. Generally 
speaking, we envision El/h as a moderate integer (4 to 10, say). 

We approximate (2.4)-(2.7) by a mixed finite element method. Other 
discretizations could be employed; however, the local conservation of these 
methods make them attractive for porous media simulation They ap- 
proximate both the pressure and Darcy velocity directly. 




L-® ® — 

Coarse BDDFi velocities 





c > 






c > 




; 


c > 





d-scale RTq velocities 



Figure 1: The degrees of freedom of the approximating spaces. 



As depicted in Fig. 1, we approximate the coarse equation (2.5) on the 
coarse grid with BDDFi spaces (BDMi in 2-D) 0 0 These spaces have 
1 degree of freedom per coarse element for the pressure approximation, Wc, 
and 3 degrees of freedom per coarse element face (2 per edge in 2-D) for 
the velocity, Vc. They are second order accurate in H for the velocity and 
first order for the pressure. The fine grid is used for the (5-equation (2.7). 
We use within each coarse element RTq spaces m- These have one degree 
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of freedom per fine element for the pressure approximation, 5W , subject 
to the requirement of orthogonality to Wc that the average over the coarse 
element vanishes, and RTq has one degree of freedom per fine element face 
for the velocity, <5V, subject to the closure assumption (2.6). They are first 
order accurate in h for both the pressure and velocity for pressures with zero 
average over coarse elements and velocities with zero normal components on 
coarse element boundaries. 

We remark that pressure is resolved fully on the fine scale, and formally 
approximated to first order in h. The fact that the BDDFi space approxima- 
tes velocity to second order compensates for the closure assumption, which 
assumes all net flow between coarse elements is on the coarse scale. Without 
this choice, the results degrade significantly 

4 Solution by Numerical Greens Functions 

We present now a technique to solve the system of equations efficiently. Before 
we elaborate, the outline of the technique is as follows. 

1. Pre-solve for the influence of the coarse scale on the 5-scale. These are 
small disjoint problems, one for each coarse element, by (2.6). These 
pre-solutions are numerical Greens functions for the 5-problems (2.7). 

2. Solve the coarse scale problem (2.5), accounting for the pre-solution 
response of the 5-scale to the coarse scale in (2.4). 

3. Post-solve to combine results to form the fine-scale solution. 

Integrals over coarse elements of the form Wc*5W vanish by orthogonality. 
Since V • Vc = Wc and V • 5V = 5W , integrals of V • 5V + Wc and V-Vc* 5W 
also vanish; thus, several terms in the equations below vanish. 

4.1 Pre-solution 

Locally on each coarse element Ec, let Vg ^ G have flux only at a single 
degree of freedom {i = 1, ..., 18 in 3-D and i= I , ..., 8 in 2-D). Then 




(4.1) 



Solve the following problems for the numerical Greens functions. 
Nonhomogeneous terms. Find 5 uq G 5V and 5po G 51T such that 



f V ■ SugSwdx = f fSwdx M 5w & 5 W\e^, 

f K~^Suq ■ Svdx = f SpoV ■ Svdx V5v G 5V|£;_,. 



(4.2) 
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Influence ofvc^i- For each i, find 5uj G <5V and 6pi G 6W such that 
fV-SuiSwdx = 0 ^ 5w € 5 W\e^, 

f K~^(vc,i + 5ui) ■ 5v dx = J SpiV ■ Svdx V Sv G <5V|£;^. 

Note that the combinations 



(4.4) 



5uo + J2t = ^«(Uc, /) = (5u, 

+ J2i ^Pi = ^p(Uc, /) = Sp 
depend linearly on the (as yet unknown) nodal values of Uc and on the nu- 
merical Greens functions and give i5u and Sp solving (2.7). 



4.2 Coarse Solution 



Given the numerical Greens functions and the implicit representation of the 
upscaling operator (4.4), we can now reformulate (2.5) as a problem for the 
course unknowns only. We find Uc G Vc and Pc G Wc such that 

/v-w Wcdx = J f Wcdx \/Wc G Wc, 

f K~^(Uc + 4>„(Uc,/)) -Vcdx = JpcV -Vcdx VVc G Vc. 

We rewrite this system with Vc = Vj using that 

u = Uc -I- (5u = (5uo -I- C(i (vc,* + <^w), 

p = Pc + Sp = pc + Spo + J2i 



(4.5) 



(4.6) 



and using (4.2) and (4.3) with 6v = i5uj and orthogonality as 
/ ^ ■ (yc,i + iJui) Wcdx = J fwc dx, 
cti f K~^(vc,i + Sui) ■ (vcj + Suj) dx 

= J(Pc + 5po) V • (vcj -I- i5uj) dx — J K~^6uq ■ (vcj -I- i5uj) dx. 

Thus the method is similar to an “optimal test function” method where we 
replace Vc by Vcj + i5uj; however, we also add some nonhomogeneous terms 
that improve the accuracy over such “optimal” methods. 



4.3 Post-Solution 

Given Uc, Pc, and the numerical Greens functions, compute (4.6) on the fine 
scale to obtain a “fully resolved” approximation of the true solution. 



5 Accuracy 

Denote by || • || the L^-norm, and by FVc> Pwf, and Psw the L^-projections 
into Wc, the full fine grid space Wf = Wc © 5W, and 5W, respectively. 
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If we solve the entire problem (2.1) for (uf,pf) G RTq over the entire fine 
mesh or for (ucPc) G BDDFi (or BDMi in 2-D) on the coarse mesh, we see 
the following error estimates dElEI, where (u,p) is the true solution. 

Theorem 5.1 For RTq with no upscaling on the fine mesh, for v G Vj such 
that V • V = Pwf f, 

\\K-^/\u-Uf)\\ <M\\K-^/^{u-v)\\ <Ch, \\p-pf\\<Ch. 

For BDDFi (or BDMi) with no upscaling on the coarse mesh, for v G 
such that V • V = Pwcf j 

||X-i/2(u-u,)|| <inf||iF-i/2(u-v)|| \\p-p,\\<CH. 

V 

The upscaling technique displays elements of both estimates above. It is 
easy to prove the following error estimate. 

Theorem 5.2 For BDDFi (or BDMi) upscaled with the RTq subgrid ap- 
proximation, for V G Vc + JV such that V • v = Pwff, '^c € Vc such that 
V • Vc = Pwcfj ond Sv G JV such that V ■ Sv = Pswf, 

- (uc + ^u))|| < infv ||RT“^/^(u - v)|| 

< infQ,v„5v {||r:-i/2(u- V c)|| + Q_ ^v)||} , 

lb- Pell < CH. 

While it is difficult to interpret the velocity error, we should expect better 
bounds than in Theorem 15 . 1 1 that is, we should expect the velocity error to 
be second order. Our numerical results suggest that this is indeed the case, 
and that the error has in fact no simple form (as indicated in the theorem). 
We consider two test cases in which K = 1 on a unit square domain and 
we use Dirichlet boundary conditions and / defined from the imposed true 
solution p{x,y) = xy^ + x^ycos{xy) or 1/(1 + exp(10x + lOy^ — 3y — 5)), 
respectively. 

It is readily apparent from the data in Table 1, that if F[/h is fixed, the 
error in pressure is 0{H) = 0{h), and the error in velocity is 0{H^) = 
0{hf), as we would expect. However, if H/h is not fixed, the results are 
much less predictable. For the tests reported and a few more conducted, 
the best fit of the L^-error in u is if/ = 150if^ + and if/ = 

lOOOii^ + SOOOiF^/'^h’^/'*, respectively for the two cases, and for the pressure 
p, the error is E], = 0.36h + 0.0003i? and El = QAh + 0.0006iJi'^h-° b 
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respectively. Thus the error depends in a complicated way on the solution 
and probably on H/h, but p is first order and u is second order accurate. 

It is interesting to note from Table 1 that the error in the pressure is 
dominated by the fine mesh size, nearly independently of the coarse mesh. 



l/h 


l/H 


Case 1 
Pressure 


Velocity 


l/h 


l/H 


Case 2 
Pressure 


Velocity 


10 


10 


0.0359 


5.63 


10 


10 


0.0438 


173.64 


20 


20 


0.0180 


1.42 


20 


20 


0.0219 


47.08 


40 


40 


0.0090 


0.36 


40 


40 


0.0109 


12.03 


80 


80 


0.0045 


0.16 


80 


80 


0.0055 


3.03 


10 


2 


0.0359 


46.79 


10 


2 


0.0440 


413.35 


20 


4 


0.0180 


11.14 


20 


4 


0.0220 


185.70 


40 


8 


0.0090 


2.76 


40 


8 


0.0109 


43.25 


80 


16 


0.0045 


0.69 


80 


16 


0.0055 


10.82 


160 


4 


0.0023 


8.97 


80 


2 


0.0095 


273.92 


160 


8 


0.0022 


2.26 


80 


4 


0.0060 


159.92 


160 


16 


0.0022 


0.60 


80 


8 


0.0055 


36.50 


160 


32 


0.0022 


0.18 


80 


16 


0.0055 


10.82 


10 


2 


0.0359 


46.79 


10 


2 


0.0440 


413.35 


20 


2 


0.0180 


41.09 


20 


2 


0.0230 


310.56 


40 


2 


0.0091 


39.39 


40 


2 


0.0134 


281.47 


80 


2 


0.0046 


38.94 


80 


2 


0.0095 


273.92 



Table 1. Some L^-errors for Cases 1 and 2. 



6 Two-Phase Immiscible, Incompressible Flow 

We now turn to a nonlinear problem describing the flow of two immiscible, 
incompressible fluids in a porous medium, such as oil and water. For phase 
j = w,o (i.e., water and oil), let u^, and pj be the phase saturations, 
Darcy velocities, and pressures. Let s = s^; = 1 — So, ^ be the porosity, 
K the absolute permeability, g the gravitational constant, and z the depth. 
The mobilities are related to the relative permeabilities and fluid viscosities 
as \j{s) = krj{s)/pLj and A(s) = Au,(s) + Ao(s), and Pc{s) = Po — Pw is the 
capillary pressure. Conservation of mass of each phase gives the governing 
equations. After reformulation, we obtain the following (see, e.g., @). 
Pressure equation: 

V • u = / = /„, + /o, u = -ATA(s)(Vp- p(s)Vz), 
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where the global pressure and density are 

P = Po- / '/ ^ PM) da- and p[s) = -rj-rPw + -ri-rPo- 

Jo Kv Ks) 

Saturation equation: 

3s 

+ V • = fw(s), = -KVq{s) + 7(u, s), 

where the “complementary” potential and 7 are 
9(s) = - lo da, 

7(u,s) = (p„ - p^)gVz. 

We use sequential time splitting, a backward Euler time discretization, 
and integration-by-parts (3 times) to obtain the following variational form 
and time approximation for At > 0 and time levels t" = nAt. 

Pressure equation: 

f V ■ u’^wdx = f f^w dx, 

f (A(s"~^)K)~^u^ ■ vdx = f p"V ■ vdx + f p(s"~^)Vz ■ v dx. 
Saturation equation: (wherein w € Wf = Wc © 6W) 

Isf ' wdx + fj^^V-7p^wdx + 7(u”, s”p) -nwds 

= Isf fw(s^)wdx, 
f • V dx = f q{s’^) V • V dx, 

where f/'" = —KVq{s'^), u(), = ^/>”+ 7 (u”, s"), and we use one-point upstream 
weighting on the term involving 7. 

As in the single-phase case, we separate the solution into coarse and fine 
scales Vc © <5V or Wc © SW : 

V = Vc + Sv, U = Uc + du, 'tp = 'tp^ + Sip, 
w = Wc + Sw, P = Pc + dp. 

Because the saturation equation is parabolic, it turns out that we do not 
need to decompose s G VE/. 

The pressure equation is linear and independent of the saturation equa- 
tion, given s”“^. We can solve for the upscaled u" and p" as above. 

We linearize the saturation equation with Newton-Raphson, and solve 
for changes in s" and ■0”, given u", using numerical Greens functions as in 
the linear case above. Upstream weighting on the fine scale destroys our 
localization assumption. To circumvent this, we simply use the old Newton 
result for the upstream value when it traces out of a coarse element. 
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7 Some Numerical Examples 

We present two 2-D examples to illustrate our upscaling technique. In both, 
we have a square domain with uniform rectangular grids. The initial water 
saturation is 0.2. An injection well is placed in the lower left corner injecting 
water at a rate of 0.2 m^ /day, and a production well is in the adjacent corner. 
Time steps vary from 1 day initially to 25 days near the end of the simulations. 
The porosity is 0.25, but the permeability is heterogeneous. 



7.1 Example 1 



In this example, we have a 40 meter by 40 meter domain with a 40 x 40 fine 
grid. The base 10 logarithm of the permeability field is shown in Fig. 2. 




Figure 2; The log of the permeability held for Example 1. 





Figure 3;Fine-scale saturation at 100 and 500 days for Example 1. 



As can be seen in Figs. 3-4, the upscaling procedure approximates the 
saturation quite well. Here we use a 5 x 5 coarse grid, so on each coarse block, 
we have an 8 x 8 subgrid for the J-problems. The coarse solution (Fig. 5) 
completely fails to resolve the flow and location of the wells. 
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Figure 4:5 x 5 Upscaled saturation at 100 and 500 days for Example 1. 





Figure 5:Coarse-scale saturation at 100 and 500 days for Example 1. 



In Figs. 6-7, we show the results of upscaling with 2x2 and 8x8 coarse 
grids (20 x 20 and 5x5 subgrids). Both perform quite well. The coarsest 
example does a very good job near the well, but the performance degrades 
later in time a bit as the flow reaches the middle of the domain. 

The number of degrees of freedom used in these examples is given in 
Table 2. The coarse solution is woefully inadequate; however, for the cost of 
a global problem of the same size, we can upscale to a very reasonable level 
of resolution. 





Coarse grid 




2x2 


5x5 


8x8 


Coarse Velocity 


8 


80 


224 


Upscale Velocity 


3048 


2880 


2784 


Fine Velocity 


6240 


6240 


6240 


Coarse Pressure 


4 


25 


64 


Upscale Pressure 


1600 


1600 


1600 


Fine Pressure 


1600 


1600 


1600 



Table 2. Number of degrees of freedom for Example 1. 
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Figure 6:2 x 2 Upscaled saturation at 100 and 500 days for Example 1. 




Figure 7:8 x 8 Upscaled saturation at 100 and 500 days for Example 1. 

7.2 Example 2 

In the second example, we have a 24 x 24 meter domain with a 24 x 24 fine 
grid and a 4 x 4 coarse grid. The base 10 logarithm of the permeability is 
depicted in Fig. 8. It has two high permeability streaks, akin to fractures. 




Figure 8:The log of the permeability field for Example 2. 
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Fig. 9 shows that the saturation is very difficult to resolve. The upscaling 
technique does a reasonable job following the flow into the first high per- 
meability streak. The coarse solution in Fig 10, however, completely misses 
the high permeability streaks. It shows an overall tendency to flow right to 
left rather than the proper direction down to up. The number of degrees of 
freedom used in this example is given in Table 3. 




Figure 9;Fine-scale and 4x4 Upscaled saturation at 20 days for Example 2. 




X 



Figure 10;Coarse-scale saturation at 20 days for Example 2. 





Velocity 


Pressure 


Coarse 


48 


16 


Upscale 


1008 


576 


Fine 


2208 


576 



Table 3. Number of degrees of freedom for Example 2. 

Some timing results are given in Table 4. The pressure equation is solved 
with Jacobi preconditioned conjugate gradients, the saturation equation by 
Jacobi preconditioned orthomin, and the upscaling numerical Greens func- 
tions by a direct solver. The time to compute the 24 x 24 fine scale solution 
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is quite high, since the problem is very poorly conditioned. In contrast, the 
4x4 coarse problem is solved easily. In this example, the upscaled problem 
takes about as long to solve as the coarse problem; computing the numerical 
Greens functions takes little extra time and gives a much improved solution. 



Steps 


Fine 


Upscale 


Coarse 


1 


2:03 


0:10 


0:08 


2 


1:48 


0:09 


0:08 


3-10 


11:32 


1:10 


1:05 


11-20 


13:29 


1:28 


1:21 


21-36 




2:21 


2:13 


37-48 


35:47 


2:02 


1:52 


49-58 




1:42 


1:33 


59-65 


30:26 


1:21 


1:14 


66-72 




1:21 


1:14 


73-76 


19:22 


0:46 


0:42 


77-84 


38:48 


1:34 


1:25 



Table 4. Some timing results for Example 2. 



8 Conclusions 

Our upscaling approach improves the resolution of the computed solution. It 
allows recovery of fine-scale pressure, velocity, and saturation, so it incorpo- 
rates fine-scale information and nonlinearities directly, thereby circumventing 
the need to define pseudo- functions. The technique resolves positions of wells 
within grid blocks, is efficient to compute, has good convergence properties, 
can be applied at each time step of a time dependent problem, and can be 
applied to a nonlinear problem during a Newton linearization step. 
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Abstract 

The simulation of realistic multiphase flow problems in porous me- 
dia requires efficient solution methods and means for handling the com- 
plicated structure of the media. The numerical toolbox UG has been 
developed to be able to use the approaches-multigrid, unstructured 
grids, adaptivity, and parallel computing-for a wide range of applica- 
tions. An extensive library of discretization schemes has been imple- 
mented with UG in the MUFTE project. In this paper we describe 
a finite volume method for two-phase flow in fractured porous media 
and present results for the parallel multigrid solution of a gas- water air 
sparging problem. 



KEYWORDS: multiphase flow, fractures, numerical toolbox, unstructured 
grids, multigrid, parallel computing 

1 Introduction 

The simulation of realistic problems from environmental engineering, oil re- 
servoirs, and waste disposal sites requires the combination of different nume- 
rical methods. To overcome the difficulties associated with the complicated 
structure of porous media and the highly nonlinear behavior of multiphase 
systems, a combination of unstructured grids, adaptive local grid refinement, 
multigrid methods, and parallel computing is necessary. However, the com- 
plexity of a program with such a combination increases with each component 
by several orders of magnitude when compared to a structured sequential 
program. UG |21 provides a toolbox that combines these approaches for 
workstations and parallel computers and paves the way to use them for mul- 
tiphase flow problems. Based on the program MUFTE, several applications 

Z. Chen, R.E. Ewing, and Z.-C. Shi (Eds.): Proceedings 1999, LNP 552, pp. 50-^^ 2000. 

(c) Springer-Verlag Berlin Heidelberg 2000 
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for the simulation of multiphase, multicomponent processes have been imple- 
mented. 

After an overview of a two-phase flow problem, we present a finite volume 
method that is part of MUFTE/UG and its extension for discontinuous and 
fractured porous media. We focus on the simulation of gas-water systems 
in the fractured media and present results for a gas-water simulation on 
parallel computers, gas infiltration into fractured porous media, and gas flow 
in single fractures. Future work will involve applications of MUFTE/UG to 
multiphase, multicomponent flows with mass interchange between phases. 

2 The Numerical Toolbox UG 

The main objective of the UG project is to provide a fundamental framework 
on which complex applications can be built. The basic building blocks of its 
software include geometry representation, mesh generation and its modifica- 
tion, discretization, solvers for linear and nonlinear systems, mesh refinement 
and error estimation, and tools for postprocessing and visualization. Since 
full support in all generality for each of these items is hardly possible within 
the framework of a research project, the UG project focuses on unstructured 
grids, local grid refinement, robust multigrid methods, and parallel compu- 
ting on MIMD-type supercomputers. These concepts are present throughout 
the whole design of UG. The modules of UG are shown in Fig. 0. 



user interface 


user code 




numerical algorithms 


(e.g. MUFTE) 




1 


graphics 


linear algebra 


numerics support 


grid manager 










2D mesh 


gen. 


3D mesh gen. 


domain manager 


output devices 


CHACO 


dynamic distributed data (DDD) 










PPIF 





Figure 0: The modular structure of UG. 



The dynamical distributed data (DDD) layer ^ is responsible for creating 
and maintaining the distributed unstructured mesh data structure and the 
associated vectors and sparse matrices. DDD can migrate object copies of 
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a node to other processors, for example, while automatically maintaining its 
references to neighboring objects and the corresponding interface lists. To 
ensure portability and efficiency, DDD uses the parallel processor interface 
(PPIF), which provides the low-level message passing function based on MPI, 
PVM, and several other vendor specific message passing systems. 

The domain manager handles geometry representation, based on two dif- 
ferent implementations. The first implementation, the “standard domain”, 
is based on a piecewise description of the boundary. Each boundary segment 
is defined by a mapping from a parameter space A C TZ^~^ to TZ‘^, d = 2,3. 
In the second implementation, the “linear geometry model” (LGM), each 
boundary segment is an unstructured triangular mesh in 3D space. It is use- 
ful for complicated domains that do not allow for a parameterization. The 
output devices implement support for graphical user interfaces on XI 1 and 
Macintosh and deliver graphic outputs to files in the postscript and PPM 
format. The “low” module provides the basic function, like file I/O and ti- 
ming. CHACO jS] is a graph partitioning package that is included for load 
balancing of the grid. 

The grid manager is at the heart of UG. It is responsible for the creation 
and modification of the unstructured mesh data structure. For the mesh 
creation process, two grid generators are part of UG; the 3D mesh generator 
was developed by Schoberl m The notion of hierarchical grids is an im- 
portant part of the UG design. It is assumed that the domain geometry is 
simple enough to be stored on each processor and that an initial mesh can 
be generated that is much coarser than the final mesh used to compute the 
solution. The grid hierarchy is also used for the construction of finite element 
spaces used in the multigrid method. To ensure flexibility in the construction 
of the finite element spaces, degrees of freedom can be associated with nodes, 
edges, and element faces (in 3D). Grids can consist of elements of several ty- 
pes, triangles and quadrilaterals in 2D and tetrahedrons, pyramids, prisms, 
and hexahedrons in 3D. Meshes can be locally refined and derefined, based 
on geometrical criteria or through error estimators. 

The graphics module on the top of the grid manager is responsible for 
the visualization of meshes and solutions in 2D and 3D. The visualization 
is parallelized and can be sent to any output device (screen or file). The 
linear algebra implements sparse matrix-vector operations and iterative sol- 
vers. The numerical support includes functions commonly needed in finite 
element and finite volume discretization schemes. 
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Linear and nonlinear solvers and time-stepping schemes are located in 
the numerical method module. Multigrid can be used as a preconditioner 
for Krylow subspace methods (Bi-CGstab, GMRES); several smoothers are 
available for the multigrid. Numerical algorithms are implemented as a set of 
classes which can be used directly or from which the application programmer 
can inherit to add new components or replace existing ones. This ensures 
flexibility and expandability together with the graphical user interface and 
the scripting language which drive UG applications. 

UG has been used as a framework for applications from several fields: 
linear elasticity, elastoplacticity, Navier-Stokes equations, and density driven 
flow. In this paper we focus on its application to multiphase flow, which is 
explained in the next section. 

The multiphase, multicomponent module MUFTE includes several appli- 
cations. The processes which can be implemented via MUFTE may have up 
to three phases and up to three components. The phase transitions have been 
taken into account via Henry’s, Raoult’s, and Dalton’s Law, and miscible flow 
can be thus treated. Nonisothermal processes can also be described, taking 
the energy equation into account. Future work will involve the addition of a 
more general phase behavior package to MUFTE. 

3 Governing Equations for Two-Phase Flow 

We consider the flow of two immiscible fluids in a porous medium U C TZ‘^, 
d = 2,3. Let T = (0,T) be the time interval of interest. We focus on the 
phases water and gas, but the consideration below is also valid for a general 
wetting phase and a non-wetting phase, each consisting of a component. 

Gonservation of mass for both phases a = w,g is expressed by 

(9(4>P„5q,) • o -7- 

f V ■ (gaUaj = QaQa in U X 7 , 

where d>(x) is the porosity, S'q,(x, f) the saturation, Qa{Pa) the density, Pa{^, t) 
the pressure, Uc,(x,t) the macroscopic velocity, and qa(x,t) the source/sink 
term of phase a. The two phases completely fill the void space of the porous 
medium: 

Sw + Sg = 1 . 

Darcy’s law is used to describe the fluid flow at a macroscopic level 
u„ = ~ — K(Vpa - gag), 
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where i^(x) is the absolute permeability tensor, fcra(x, S'q) the relative per- 
meability, gLa{pa) the dynamic viscosity of phase a, and g the gravitational 
vector. The relative permeability obeys the constraint 0 < kra{Sa) < 1- 
The pressure difference between two phases is expressed by the macros- 
copic capillary pressure 



Pc{x, t) = pg(x, t) - pu,(x, t) . 



Although 0 < Sw,Sg < 1, the wetting fluid can not be removed from a 
porous medium by pure displacement and the non-wetting fluid can not be 
replaced completely once it has entered the medium. If we do not take phase 
transition effects into account, a certain amount of each phase remains in the 
pore space, the residual saturation Sar- 

The effective saturations of the two phases are given by 



Sn,= 



1 - S,„r - S, 



gr 



- _ Sg Sgr 

® " 1 - S,„r - S, 



gr 



where 0 < < 1 and + Sg = 1. 

In multiphase systems, Pc is a function of the wetting phase saturation: 
Pc = Pc{Sw)- One often used capillary pressure function is the Brooks-Corey 
capillary pressure function: 

__ 

Pc{Sff) =pdSw^ ■ 



In the Brooks-Corey function the parameter pd refers to the entry pressure 
of the porous medium (the pressure required to displace the wetting phase 
from the largest pores) and A is related to the pore size distribution of the 
material. Typical values for A range from 0.2 to 3. Small A values describe 
single grain size materials while large A values indicate highly non-uniform 
materials. 

The relative permeabilities kra depend on the saturation of phase a. They 
can be evaluated by the Brooks-Corey relative permeability functions 



_ 2+3A 



^rg{Sg) = SI 
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4 Discretization 



4.1 Phase Pressure-Saturation Formulation 



For the equations in the previous section only two out of the four unknowns 
Sw, Sg, Pw, and Pg can be chosen as independent unknowns. Substituting 

Sw — 1 ^ 9 i Pg — Pw F Pc{f ^g'): 



we obtain the {pw, Sg) formulation in T x fl: 



d{<hgUl-Sg)) 

dt 

d{<^Qg{Sg)) 

dt 



V • QwQw O5 

+ ^ ' (Qg'^g) ~ QgQg ~ 



with Darcy’s velocities and Ug given by 



pw 

Ug = ~’^K{Vpg + Vp,(l - Sg) ~ Qgg). 

Pg 

Initial and boundary conditions of Neumann or Dirichlet type are 



Pw{x, 0) = Pwoi^) , Sg{x, 0) = Sgo{x) in , 

Pw{xi,t) = Pwd{y^,t) on , Sg{x,t) = Sgd{x,t) on F^d , 
Qw^w ' a — (pw(x-ft) on F.^;^ , • n — t) on Ifgn ■ 



(4.1) 



(4.2) 



In a similar way a (pn,Sg) formulation can be derived. While flux type 
boundary conditions can be specified for both phases in both formulations, 
Dirichlet type boundary conditions can only be applied for variables present 
in these equations. A further comparison of the equations reveals that the 
{Pw, Sg) formulation should be used if Sg is bounded away from 1 while the 
{Pg, Sw) formulation should be used if Sw is bounded away from 1. 



4.2 Finite Volumes for Two-Phase Flow 

To ensure the applicability of the discretization method for equations (4.1) 
and (4.2) to a wide range of applications, a fully implicit and fully coupled 
finite volume method is chosen. The method is described in detail in |5, so 
we restrict ourselves to the core ideas. 

The finite volume method requires the construction of a secondary mesh. 
In the present case of vertex centered finite volumes, the secondary mesh 
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Figure 1: Secondary mesh and notation for control volumes. 

B is constructed by connecting element barycenters with edge midpoints as 
shown in Fig. Q in 2D. In 3D, the element barycenters are first connected to 
element face barycenters and these are then connected with edge midpoints. 
Each control volume Bi belongs to a grid vertex Vi. The intersection of a 
control volume Bi with element k is denoted by {sub-control volume). 

We use the index set I = {1, . . . , A^}, with N being the number of vertices 
of the grid and the index sets lad = {i S / | ^ Fad} for vertices that are 

not part of a Dirichlet boundary. Along with these index sets, we define 
appropriate subsets of the standard conforming finite element space Vh C 

Vfi = {v & C°(n) I V is (multi-) linear on t G Eh} 
and the non-conforming test space Wh 

Wh = {w G L^{fl) I w is constant on each Bj G B}. 

For Vh and Wh, we have 

Vi, j G I V(pi G Vh ■■ = Sij and Vf, j G I Vi)i G Wh : ■ 

The spaces Vahd and Wahd used for the discretization depend on time t: 

^ahd{t') — {u G Vh I u(x^) — Sad{^i,t{ , i G J\Jad} , 

Wahd = {W GWh\ w(Xj) = 0, f G I\Iad}- 

The weak formulation of the present problem is: Find Pwh{t) G Vwhd{t) and 
Sgh{t) G Vghd{t) such that, for Wah G Wahd {a = w,g) and t G T, 



^ah {Pwh (f') , ^gh {t'} , Wah ) T ^ah {Pwh (f ) , ^gh {t{ , Wah) 
~bQ ah{l', Pwh{t) , ^ gh{t) , Wah) di 



(4.3) 
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with the accumulation terms Mah, the internal flux terms Aah, and the 
source/sink and boundary flux terms Qah being given by 

Mi ah i,Pwh (t) 5 Sgh (t\ aJ ah) — 'aJah (^z) J Si ^ dx, 

Aah {Pwh (t) ; Sgh (t) , Wah) — (^z) SdSitlQ ‘ n ds, 

Qah{ti Pwh (t) 5 Sgh (t) 5 aJah) — ^^z^/ a^ah (^z) ^ JdBinTan '^Bi ttaQa dx^ 

where in the first equation S^h is replaced by 1 Sgh- 

The precise evaluation of the quantities on the discretized domain is ex- 
plained in detail in mm The interior flux term is evaluated using an up- 
winding scheme. As an example, we show the accumulation term for the gas 
phase: 

M-jrihijPwhi Sghi‘4^i) — / ^hQnhS gh drT- 'ss ^ ^ ymeas(bj ) , 

where gn,i, and Sgy are the discrete values of the corresponding quantities 
at grid vertex i. 

5 Interface Conditions 

Media with discontinuous properties, such as fractured media, require a spe- 
cial treatment of the solution at these discontinuities. Consider a medium fl 
that consists of two subregions and , with different absolute permea- 
bilities. For an isotropic medium, we have K(x) = fc(x)I, with k = in 
and k = k^^ in . If k^ > then represents a coarse material 
compared to . Other properties, such as the porosity, may be different in 
the subregions as well. 

We assume that fl is initially fully saturated with water and that 
lies above Fig. El shows the sketch of typical Brooks-Corey capillary 
pressure functions. If a gas phase is entering the medium from below, it will 
eventually reach the interface between and , but will only enter 
if the capillary pressure is large enough. This pressure is called the threshold 
pressure or non-wetting phase entry pressure; it corresponds to pd in the 
Brooks-Corey capillary pressure functions. In Fig.Q, the saturation that has 
to be reached in order for the gas phase to enter is indicated by S^. 

If both fluids are present in both regions and we consider a point on the 
interface, we can look at this point from either side of the interface. If we 
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Figure 2: Grid at an interface and capillary pressure for discontinuous media, 
choose to look from , the wetting phase saturation is 5^ (with 5^ < 
since the fluid is already present in both regions) . Since the capillary pressure 
is an intensive variable, it is a continuous function on the whole medium 
so pUS^j) = Consequently, the extensive variable saturation is 

discontinuous at the interface. 

To incorporate this discontinuity into the (p^, Sg) formulation, note that 
no mass is lost or produced at the interface: Qw'^^w • n and PgUg ■ n are 
continuous across the interface. Furthermore, we use the extended capillary 
pressure condition uni for points on the interface. It uses the threshold satu- 
ration that can be found from pl{S^) = pi^{l) and defines in terms 
of the inverted capillary pressure function for if has been reached 
from 

0 ^ s^g<s; = i-s*^, 

" 1 1 - - s^g)), s^g >s; = i- 5 * . 

This ensures that H Sg <1 — S^, we have S'g = 0 in i.e., if the threshold 
pressure has not been reached, the non-wetting phase will not enter. Note 
that the capillary pressure is only defined if both phases exist and will only be 
defined in if the non-wetting phase has entered. See jS| for a theoretical 
and numerical examination. 

The interface conditions are incorporated into the discretization by ima- 
gining duplicated vertices on the interface as shown on the left in Fig. 0 
The grid has to resolve interfaces with element edges or faces in order for the 
interface conditions to work. 

The pressure p^ is continuous and its value is identical for both vertices. 
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To represent discontinuous values for the saturation in v( and , we define 
the vector Pcmin, with Pcmin,i = minfeg£;(i) Pcipi'^, 1 - where E{i) is the 
set of indices of all elements having vertex Vi as a corner. Using Pcmin, we 
can compute the saturation at vertex Vi with respect to element Ck- 



^n,i,k 



if ; 1 Sn,i) Pcmm,zj 

0; Pcmm,z <Pc(x^l), 

1 - S', where S solves S) = Pcmin,i- 



The evaluation of saturation with respect to Ck for any x G is done by 



Sn/i|efc(x) — ^ ^ Sji,m,kPm{N)j 

mGV(k) 

where V{k) is the set of all vertices of element k. Also, all quantities that de- 
pend on the saturation are now evaluated element-wise (dependent on Sn,i,k)- 



6 3D Air Sparging 

The first example simulates the bubbling of air in a 3D heterogeneous porous 
medium. The domain is 5 meters high, about 4 by 5 meters wide and contains 
three lenses with different sand properties. The finite volume method with 
interface conditions in the {pw,Sg) formulation is used. 

The boundary conditions are p^ = 10® [Pa] ,(/>„ = 0 on the top boundary, 
= </>u; = 0 on the sides and bottom, and 4>n = — 3 - 10 “® [kg/ (sm^)j, = 0 

at the inlet boundaries on the bottom. See Fig. 0 for the location of the inlet 
boundaries. 




Figure 3: Geometry, coarse grid and isosurface for Sg = 0.05 for 3D air sparging 
problem. Solution is shown for t = 640[s] (Visualization with GRAPE). 
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The fluid properties are = 1000 [kg/m^], Qg = pg/84149.6 [kg/m^], 
= 10“^ [Pas], and = 1.65 • 10“® [Pas]. The initial values are Pw{x, y) = 
10^ + (5 - 2 /) • 9810.0 and Sn = 0. 

The Brooks-Corey constitutive relations are used with the parameters: 



Sand 


4) 


k [m^] 


Swr 


Snr 


A 


Pd [Pa] 


Q* 

‘^71 


0 


0.40 


5.04-10-^“ 


0.10 


0.0 


2.0 


1600.0 


- 


1 


0.39 


2.05- 10-1° 


0.10 


0.0 


2.0 


1959.6 


0.30 


2 


0.39 


5.62- 10-11 


0.10 


0.0 


2.0 


2565.7 


0.55 


3 


0.41 


8.19- 10-12 


0.10 


0.0 


2.0 


4800.0 


0.80 



Figure0 shows the coarsest mesh, consisting of 1492 tetrahedral elements. 
All internal boundaries are resolved by faces of the initial mesh. The mesh 
is generated with NETGEN By uniform refinement, a grid hierarchy of 
five levels is produced, with 6,111,232 elements and 1,040,129 nodes on the 
flnest grid. 

Figure 01 shows on the right an isosurface of non- wetting phase saturation 
Sg = 0.05 at final time T = 640 [s]. The initial time step size is At = 8 [s], 
so 80 time steps are needed (unless a time step reduction is enforced by the 
nonlinear solver). Visualization has been done with the graphics program 
GRAPE H3I. 

The results show that the finite volume method with interface conditions 
works with 3D unstructured meshes and captures correctly the discontinuities 
in the saturation that can especially be seen for the uppermost sand lense 
with a relative permeability that is two orders of magnitude smaller than the 
surrounding sand. 

Thanks to parallel capabilities, the computations for this problem can 
be carried out on a Gray T3E. Performance results can be found in the 
table below for up to a million nodes (i.e., 2 million unknowns) mapped to 
128 processors. The problem size is scaled with the number of processors, 
yielding constant workload for each processor for all configurations. Starting 
with two processors and scaling by 64 results in an almost fourfold increase 
in total computation time. This can be explained by the increased number 
of Newton steps on the flnest mesh. We believe that this is due to the very 
thin layers of air under the sand lenses. 



P 


S 


^ elements 


T 






^^avg 


^mg 


2 


80 


95488 


10771 


247 


1355 


5.5 


3.44 


16 


81 


763904 


15201 


320 


1909 


6.0 


3.76 


128 


83 


6111232 


37297 


693 


4684 


6.8 


3.99 
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The multigrid method uses a point-block Gaufi-Seidel smoother and per- 
forms very well as seen from the table above where we use the following 
abbreviations: P the number of processors, S the number of time steps, T 
the total execution time, itni the total number of newton iterations, itun the 
total number of multigrid cycles, itavg the average number of multigrid cycles 
in each Newton step, and tmg the time for one multigrid cycle in seconds. 
The multigrid method scales very well with respect to the average number of 
iterations and the time per iteration (parallel efficiency). 

7 Discretization with Fractures 

In the discretization of two-phase flow in fractured media we employ two 
fundamental properties of fractures. First, due to their small width, fractures 
can be discretized by means of finite elements or finite volumes of smaller 
dimension. Second, if we do not consider open fractures, the fractures can 
be treated as a porous medium. 




Figure 4: Finite volumes for fractures. 

For the discretization we assume that interfaces between subregions and 
fractures are resolved by the grid. This situation is depicted in Fig. 0 where 
part of a fracture and the surrounding grid is shown. Since the interface 
conditions can be applied for vertices that lie on several subregions, we can 
apply the method to fractures as well. Vertex vi in Fig. 0| is part of three 
subregions: two matrix regions and the fracture. 

To illustrate the discretization process, the grid detail is shown twice in 
Fig.S on the left with just the grid as it is stored and on the right with 
duplicated vertices at the fracture. Note that the vertex Vi still exists only 
once, but in the discretization the quantities associated with it may differ 
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(dependent on the element we are viewing the vertex from). Also, shown 
is the secondary grid for the finite volume method. While the construction 
of the secondary grid for matrix elements remains the same, additional ID 
finite volumes along the fracture in TZ^ and 2D finite volumes in TZ^ have to 
be considered. In Fig. E] two ID elements with Vi as one of their nodes are 
depicted. 

The weak formulation of the problem is still as in (4.3). It is only in the 
evaluation of the terms Mah-, -^ah, and Qah that we have to take into account 
the increased number of finite volumes. 



8 Gas Infiltration into Fractured Media 

The effect of gas infiltration into a fractured medium is simulated on an 
example with five fractures. The domain size is 80m x 100m; the position of 
the fractures can be seen from Fig. El 

The fluid properties are = 1000, = 10“^, Qg = Pg/84149.6, and 

jjLg = 1.65 • 10“®. The Brooks-Corey constitutive relations are used with the 
parameters 4) = 0.4, k = 10“®, and pd = 1000 in the matrix and 4> = 0.39, 
k = 10“®, and pd = 2000 in the fracture. A = 2 and Sgr = S^r = 0 is used 
for both matrix and fracture. The fracture width is 0.04m. 

The initial values are Sg = 0 and p^ = (100 — y) -9810.0, and the boundary 
conditions are Sg = 0, (f>w = 0 on the north boundary, (j)g = 0, and hydrostatic 
pressure on the sides, and (f>g = 0, (j)^) = 0 on the south boundary except for 
the intervals [24m, 32m] to [48m, 56m], where Sg = —0.06 [kg/{sm‘^)]. 




Figure 5: Gas infiltration into fractured domain at t = 44s, 55s and 85s. 
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The problem is solved with a Newton method for the nonlinear equations 
and a multigrid-preconditioned Bi-CGstab on a grid hierarchy of five levels. 
The coarse grid has 91 elements and the finest grid has 27776 elements, which 
is created by three uniform refinement steps and one adaptive refinement step 
that only refines elements neighboring a fracture. ILU is used as a smoother 
for the multigrid method. The time step size is At = Is. 

Fig. 0shows how gas infiltrates the water saturated medium and that gas 
moves very fast inside the fractures once the gas phase has reached them. 
At the top end of the fractures the gas phase is trapped until enough gas 
is there in order for the entry pressure for the matrix to be reached; then 
the matrix is infiltrated. The parameters for matrix and fractures have been 
chosen such that these effects can be observed. In natural rocks, material 
parameter differences between matrix and fractures are often so large that 
no infiltration from the fractures into the matrix takes place. 

9 Comparison of Different Approaches 

The strong influence of the topography of fractures on the permeability and 
on the multiphase flow behavior has been realized only in the recent years. 
Recent studies of naturally fractured media uniia have shown that the de- 
scription of the relative permeability must account for the roughness of the 
fracture walls, the fracture aperture, and the contact areas. The geostatisti- 
cal model in uni assumes that both phases can only flow simultaneously if 
the fracture apertures are correlated anisotropically. A survey of other ap- 
proaches for the description of relative permeability-saturation relations in 
fractures can be found in |SI. 

In [II 3) the rough fractures are discretized as a field of parallel plates 
with different averaged apertures a,ij (see Fig. 0. The permeability of each 
parallel fracture is given by k = b'^/12. Normalizing the fracture domain to 
unit thickness, the permeability of each averaged aperture gets k = a^/12. 
The cut- off- aperture is defined by the capillary pressure Pc = (2cr cos a)/oc, 
with the surface tension a. The contact angle a is assumed to be zero. 

In the water phase is assumed to fill the void space between all plates 
with apertures lower than the cut-off-aperture Oc and the gas phase is assu- 
med to fill the spaces between all plates with apertures bigger than Qc- In 0 
this approach is referred to as the separation assumption. 

In 0 a new aperture distribution based model is formulated. It assu- 
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Figure 6: Approximation of rough fractures by parallel plates. 



mes that a fraction of the plates with dij > ac is still filled with the water 
phase. This fraction is given by the factor a. This assumption is called the 
mix assumption. Using the mix assumption, |2j reports better results for 
the relative transmissibilities and the gas saturation when the results of la- 
boratory observations of a degassing experiment m are compared with the 
predictions based on the different assumptions. 

Jarsjo developed two sets of constitutive relationships based on the sepa- 
ration assumption and on the mix assumption with a = 0.2 (see Fig. d). Both 
given relative permeability saturation functions have a very high gradient at 
S'u, ~ 1. At this point little changes of the saturation have a big influence 
on hydraulic conditions within the fracture. The factor a = 0.2 for the mix 
assumption results in a residual saturation for the water phase of S^r = 0.2. 





Figure 7: Comparison of capillary pressures (left) and relative permeabilities 
(right) based on separation assumption (dotted) and mix assumption (line). 

Based on the data in different permeability fields have been generated 
with the geostatistical tool SIMSET (Prof. Bardbssy, Institute for Hydraulic 
Engineering, Stuttgart University; best fit shown at Fig. |H|)- SIMSET uses 
the turning band method as the basic approach to generate geostatistical 
data. 

The entering of gas from the bottom of the domain is simulated based 
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Figure 8: Permeability field based on stochastically generated aperture distribution 
for an area of 1 m x 1 m. 



on the generated permeability fields. The length of the inlet is set to be 
0.02 m. The capillary pressure is computed for each plate via the Leverett 
condition for the relative permeability function 

no upscaling concept is established. 

For both assumptions the main flow paths are given by the areas of high 
permeability which are slightly connected. Neither the gas velocities shown 
differ much, nor the results for the effective permeability or the pressure fields. 
The most relevant differences occur in the saturations. The simulation using 
the constitutive relationships based on the mix assumption results in a wider 
spreading of the gas phase and in higher gradients for the saturation (see 
Fig. El). 

10 Conclusions 

Parallel multigrid on unstructured grids can be efficiently used for multi- 
phase flow problems in porous media. The MUFTE/UG program has proven 
to be successful for the simulation of realistic problems. While the current 
implementation focuses on a correct representation of the capillary pressure 
effect and fluid behavior at media discontinuities, a wide range of transport 
problems has been little explored so far. In a cooperation with the Institute 
for Scientific Computation of Texas A&M University, Department of Mathe- 
matics of Southern Methodist University, and the Exxon-Mobil Technology 
Company, we hope to expand the MUFTE/UG program to be able to handle 
transport problems. In particular, our future work will focus on the ap- 
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plication of MUFTE/UG to multiphase, multicomponent, multi-dimensional 
flows with mass interchange between phases. Toward that end, a successful 
combination of MUFTE/UG and a phase behavior package will be critical. 




Figure 9: Water saturation (top), gas pressure (middle) and effective permeability 
(bottom) for relationships based on separation assumption (left) and mix assump- 
tion (right) at t = 3.5s. 
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for Compressible Flow in Porous Media 
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Abstract 

Error estimates are derived for a finite element modified method 
of characteristics for a coupled system of partial differential eqnations 
modelling compressible flow in porous media. Some new techniques 
are introduced to conduct a convergence analysis. Optimal conver- 
gence rate is derived in the case of molecular diffusion and dispersion. 
One contribution of this paper is the demonstration of how molecular 
dispersion can be treated. 



KEYWORDS: compressible displacement, MMOC, FEM, convergence. 



1 Introduction 



The modified method of characteristics (MMOC) was first formulated for 
a scalar parabolic equation by Douglas and Russell 0 and then extended 
by Russell fo nonlinear coupled systems modelling incompressible flow in 
porous media. Some improved error estimates were derived in Conver- 
gence of finite element methods (FEM) with MMOC for compressible flow 
was given by Yuan but only molecular diffusion was considered. 

In this paper we consider the model for compressible miscible displace- 
ment in porous media, which includes both of molecular diffusion and disper- 
sion. A scheme of FEM with MMOC is established and analyzed. 

Consider the single-phase miscible displacement of one compressible fluid 
by another in a porous medium. A set of equations modelling the pressure 
p{x, t) and the concentration c{x, t) are given by Douglas and Roberts 



d(c)— -h V • u = d{c)— - V • (a(c)Vp) = q, 

Oc Ov 

(-0(0^1, P2)Vc) = {c-c)q, 



Z. Chen, R.E. Ewing, and Z.-C. Shi (Eds.): Proceedings 1999, LNP 552, pp. 69-^^ 2000. 
(c) Springer-Verlag Berlin Heidelberg 2000 
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for X S ri and t G J, where is a bounded domain in and J = (0, T], We 
assume that no flow occurs across the boundary 

u ■ v = D{c,Pi,P 2 )Vc ■ ly = 0, {x,t)GdflxJ, (12) 

p{x,0) ^ Po{x), c(x,0) = co(x), X € 

where v is the outer normal to Cj = Ci{x, t), i = 1,2, the concentration of 
ith component, c = Ci = 1 — C 2 , u Darcy’s velocity, k = k(x) the permeability 
of the rock, p the local viscosity of the mixed fluid, (j) = </>(x) the porosity 
of the rock, q the source-sink term, and c the specified concentration at 
injection wells and c = c at production wells. D is a 2 x 2 diffusion matrix, 
which reflects the molecular diffusion and dispersion: 

D = (Aj)2x2, Dij = Dij{c,Pi,P2) = Dij{x,C,pi,P2), 

where pi = dpjdxi and p 2 = dpjdx 2 - If only molecular diffusion is conside- 
red, then D = cj){x)dml, where dm is the molecular diffusivity and / is the 
2x2 unit matrix. 

Denote by z\ and Z 2 the compressibilities of the two components and set 

a(c) = k{x)/p{c), d{c) = (j){x){ziCi + Z 2 C 2 ), b{c) = 4 >{x)ciC 2 {zi — Z 2 ). 

For convenience, we assume the system above is f2-periodic. Thus the no- 
flow boundary conditions above can be dropped. Additionally, the following 
assumptions are needed in the error analysis: 

(1) There exist positive constants (/>*, </)*, a*, a*, d,, and d* such that 

(/)* < 4>{x) < 4>* , a* < a(x,c) < a* , d* < d 2 (x, c) < d* , 

for arbitrary x G fl, c G [— 1 + s'], and e' is a small positive constant. 

(2) D is positive-definite; i.e., there exists a constant D* such that 

e'^D{x,c,pi,p2)e > D*|ep, 

for arbitrary x G ft, c G [— e', 1 -I- e'], {p\,P 2 ) G B?, and e G R^. 

(3) There exist derivatives up to second order for o(c) and Dij{c,pi,p 2 ), 
which are bounded uniformly for x G fl, c G [—s' , 1 -I- e'], {pi,p 2 ) G R?. 

For a finite element analysis for compressible two-phase flow, see | 2 |. 
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2 The Procedure of FEM with MMOC 



Let Mh^ = Mh be a family of finite-dimensional subspaces of with the 

properties 

inf {|lw - -ipW + hc\\v - V’lli + hci\\v - '0IU~(n) + ||u - '>P\\w^iQ))} 

'ipGMh 

< M\\v\\sh^^, V G wan), 1<S<1+1, 

\i<Mh-^Ul i^GMh, 

||VV'||o,oo 

where M is independent of he- Similarly, we define another family of finite- 
dimensional subspaces of Nh^ = Nh, which satisfies the same pro- 

perties as Mh^ with I and he replaced by k and hp, respectively. 

Let s(x, t) be the unit vector in the characteristic direction and note that 






( 2 . 1 ) 



Multiplying (1.1) by test functions v, z £ and integrating by parts, 

we obtain the variational form 

(rf(c)^,v) -f (a(c)Vp, Vu) = (q,v), v G 

( a /| m |2 -7 {D{c,pi,P2)Vc,Vz) + {b{c)^,z) 

= ((c-c)g,z), z£H^{fl). 

Denote by At the time increment and let t” = nAt, n = 0, 1, . . . , A^, and 
N = T/At. If the approximation of p{x,t) and c{x,f) have been known at 
t"~\ iPh~", C^~") (^NhX Mh, we define 



,n— 1 



r-r 



=-a{cr^)vpr\ dtr = ^^ 



<j){x) 



At, 



r-\x) = r~\x). 



Denote the approximate characteristic direction from (x,t") to {x,C as 
t{x, t"). The numerical scheme of FEM with MMOC can be defined by 



pn pn 



At 



+ (a(Cr')VP,”, Vu) = (g",u), v G Nh, 



, CT - CT 



A) + {D{c--\p;:r\pj:,-^)vc^, vz) 



j^n — l 
~h 



,z) = {{Cl-Cl)q^,z), z£Mh, 



(2.3) 
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f r” > 0 

where Phi = dPhIdxi, Ph 2 = dPh/dx 2 , and = I ' ’ First, 

L \ u. 

Ph G Nh can be solved from (2.3); then G Mh can be obtained by (2.3). 

Before the error analysis, we define the elliptic projections p : [0, T] — >■ Nh 
and c : [0, T] — >• Mh by 

(a(c)V(p-p),Vn) +cr(p-p,'i;) = 0, v G Nh, 

(L>(c,pi,P 2 )V(c - c),Vz) + A(c - c, 0 ) = 0, z G Mh, 

where A and a are positive constants large enough to ensure the coercivity 
of the bilinear form over Let 

V = P-P, T^ = p-Ph, C = c-c, ^ = c-Ch- 



Then, 



p- Ph=TT + T], c- Ch = ^ + C- 



Lemma 2.1 Ifk, l> 1, there exists a constant M independent of h 

such that 

\\C\\+KK\\i + \\fj + hc\\f^\\i<Mhi+\ 

M + hMh + \\^\\ + hp\\^h<Mh>;+\ 
l|p||ivi,(j;Wi,(n)) + I|c||w^(j;iv,b(f2)) - 
||^||i<Mh^, <M. 

Let us select the initial approximation and C° = c°. Then, 

7r° = 0, = 0. (2.5) 

Throughout the analysis, the symbol M denotes a generic constant, not the 
same at different places, which is independent of all mesh parameters. Simi- 
larly, £ denotes a generic small positive constant. 

3 Convergence Analysis 

Firstly, we establish error equations. Subtracting (2.3) from (2.2) and using 
the definition of projection, we obtain 

{d{Cj:-^)dt7r\v) + {a{Cr^)Vn-,Wv) 

= - d{cr^)dtp-],v) - {d{cr^)d,v\v) 

-([a(c") - a(Cr')]Vp-, Vu) + a(r;", v). 



(3.1) 
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Then, let n = 1 and v = It is shown that 
{diCt)dtTr\dt7T^) + (a(C'0)V7ri, VdtTri) 

= -{{d{c^)^ - d{C'^)dtp^],dtTT^) - {d{C'^)dtv\dtn^) (-3 2 ) 

-([a(c^) - a{C‘jl)]V{p^ - p'^),Vdtir^) 

-([a(c^) - o(C')])]Vp\ VdtTT^) + cr(? 7 \dt 7 r^). 

By = 7 T° = 0 and inverse properties, we have 
||dt7?i|p < 

i[a{C) - a{C^^)]Vp\Vdt7r^) < M\\C - ~c°\\ \\Vp^\\ \\Vdt7T^L^ 

< M\\c^ - c°\\h’^h~'^\\dt'!T'^\\ < M{At^ + ||C°|P) + £||(it7r^|P, k>2. 

Using integration by parts, we obtain 
\{[a{C)-a{Cl)]Vp\Vdtr:^)\ 

< |([a(ci) - a{C^)]Ap^,dtTT^)\ + |V([a(ci) - a(CjJ)] • Vp\dtn^)\ 

<M{At^ + \\CT + mT) + 4dt7rH^- 

By the boundedness of d and a, we have 

\\dt44 + ^ At\\Vdt44 < M{A4 + hf + hf+^). (3.3) 

If we try to derive a reasonable evolutionary error inequality for p(x, t) 
from (3.1) which can be coupled suitably with that for c{x,t) given later, 
some essential difficulties would occur unavoidably. Differently from the case 
of molecular diffusion, we now establish another error equation from a new 
viewpoint. Operating (3.1) by dt and taking v = dt 7 r”, we obtain the error 
equation 

{dt[d{C;i-^)dtTT%dtTr'-) + {dt[a{C^-^)VTr%VdtTT^) 
f)T)^ 

= -(dt[d(c”)^ - d(c”-i)dtp”],dt^”) 

-(dt[(d(c"-i) - d(cr'))c?tp"], rftTT") (3.4) 

-{dt[d{CJ^-^)dtri%dt7r^) + a{dtV^,dt7r^) 
-(d4(a(c")-a(Cr'))Vp"],Vdt7T"), 

where n > 2. Denote the terms on the left side as Ai and A 2 , and the terms 
on the right side as A 3 , , A'j in order. 

The induction hypotheses are useful 

||dt7r’’||Loo(n) < 1, ||V7r’’||icx=(n) < 1, ||C’'llivT(f*) - (3.5) 
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Let At = o{hp), hi. = o{hp), and A: > 1. From (3.3) and the inverse properties 

||'it7r^||L=o(n) < Mhp^WdtTr'^W < Mh~^{At + h[ + hp~'~'^) < 1, 
l|V7ri|Uoo(n) <M/i-i||V7ri|| <M/r;i(At + /r' +/i^+i) < 1. 

Then from (2.5) and the above inequalities, the first equation in (3.5) is 
true for r = 1, the second is true for r = 0, 1, and the third is true for 
r = 0. Now, we suppose that the first and second equations in (3.5) are 
true for r = 1, 2, . . . , to — 1 and the third is true for r = 0, 1, . . . , to — 1. We 
demonstrate that (3.5) is true for r = m. 

By expansion, we know 

dli> dt{d{CJl-^)dtTT^,dtTrn + (dtd(Cr')rft^”-\rft^”)/2 

-(dtd(Cr^)rfi7r"-\ dt7T"-i)/2 = + A? + A?. 

By the induction hypotheses and the linearity of d{c) on c, we have 

|A? + < M{hf+^ + \\dtC~^f + WdtTT^-^f + |Mt7T"f), 

where J"“^ = For A 2 , A^, A 4 , and Aq, we have 

A 2 = (a(Cr^)dtV7T", VdtTT") + (dta(cr^)v^"-i, VdtTT") = + Al 

A 2 > a*||V(it7r”|p, 

|7li| < M(/i2'+2 + \\dte~T + l|V7r"-i||2) + £||Vdt7r"||2, 

1^3 + ^4 + Al < M(Af2 + + hf+^ + + \\dte~T + WdtTT^^)- 

Now, only A 5 and A^ are left. Divide A^ into two parts 

A = -{dtd{Cr^)dtV",dt7Tn - (d(Cr")rf?A, =Al + 712. 

By inverse properties, 

\Al\ < M{\\dte-^\\ IMtAll l|dt7T"|Uoo(a) 

+ ||dtC"”^ll IMtAll l|c^t7r"||Loo(n) + ||dt77"|| ||dt7r”||). 

By the mean value theorem and the boundedness of d^rj/dt^ (Lemma 2.1), 
we have 

712 ^ + t) - |2(r-2 + t)]dh dtTT^ 

= -(d(C'r")At-i /o^‘[S?(t”-" + t) + Atg?(t"-2 + 0 At + t)]dt, dtTT"), 
lAl^ M{At ^||-^^|li2(jn-i.4^2(Q)) + Af2 + ||dt7r”||2). 
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We can divide Ay into three parts 

Ay = -(dtla(c^) - o(c”-i)]Vp”, VdtTT”) 

-K[a(c"-i) - a(Cr')]Vp", 

-([o(c”) - a(Cr^)] Vdtp", VdtTT") = + Af + A^, 

14 + 4I< M(At^ + + lie’^lP) + s\\Vdtn^f. 

There exist c* G and C^ G 

f)n F)n 

dt[a(c"-i) - a{Cl~^)] = - -(C^)d*Cr\ 

|c* - C*| < |c”-i - c”-2| + |c”-i - C'”-1| + |c"-2 - C^-2|. 

Then, 

|4| < M(At2 + /if+2 ^ ||^n-l||2 + ||^"-2||2 + ||d,^"-l|l2) + e\\VdtTT^f. 

Multiplying (3.4) by At and summing that from n = 2 to n = m, if £ is taken 
to be small enough, by the above estimates and (3.3), we obtain 

m 

WdtTT^f + E At < M(At2 + hf + hf 

n=l W-Oj 

E:=i(iirf + + \\d,er + \\dt7T-r)At), 

for TO > 1. This is the evolutionary error inequality for p(x,t)- 

Now, we turn to the concentration equation. Subtracting (2.3) from (2.2) 
and taking 2 = we see that 

(Me, dtC) + (i?(cr\ p,Ei)ve, vd*e) 

a n n _ pn-1 

= -([</>— + . vc" - ct> ],d,n 

-([a(Cri)VP^"-i - a(c")Vp"] • Vc", dtC) 

An An— 1 An— 1 An— 1 

AU—l An— 1 

-{<!>- ^ ,dtC) + HC + C”, dtC) - iiC + C)q+, dtC) 

-(4(c",p?,p^) - i?(C'r\P4\P^"2-')]Vc",M”) 

-{b{c-)^ - b{ci~^)dtPz.dtn, 

where q+ = max(0,g). Multiplying this equation by At and summing over 
1 < n < TO, we denote the resulting terms as Bi, B 2 , ■ ■ ■ , Bu from the left to 
the right side. Obviously, 

m 

By ><!>.'£ WdtCW^- 

n—1 
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B 2 can be decomposed as follows: 

m 

b 2> i e ve) 

n—l 

m 

-i E VC”-1) = Bl + Bl 



By = 0 and 



Bl = ^p(c^-^p.r^^.r')vr,vr) > ^^*iivrf , 



we see that 



E WdtCf^t + ^D4vrf < S3 + . . . + Sii - Bl 

n—l 

By estimating negative norms and integral transformations [2], we have 
\B3 + Bq + B'i'l < 

m m 

+M E (II vrf + II VC" ip)At + £ E Her At, 

n—l n—l 



where r is the approximate characteristic direction from (C,x) to (t" ^,x). 
A direct computation leads to 

\B^ + B5 + Bs+Bq + Bii\< M{ Af2 + /i2i+2 ^ j^2k 
m m 

+ E (iic"f + htth + II )At} + £ E Her At. 

n—l n—l 

The crucial is to analyze and Big. We find that 

\dtDece\PH\pHr\ 

< M{\dte~"\ + MtC"-^i + Mtc"-i| ^ i^dteH + ivdt^"-i| + |vdtp"-i|). 
By the induction hypotheses, we have 

m— 1 m 

\Bl < MHJ+H hf + Y. ||VrfAt) + £^(|M,C”f + l|Vdt7T"f)At. 

n—l n—l 

To handle i?io, by summation by parts and ^ "''^6 obtain 

m 

Bio = E {dt[D{c^,p^,p^) - 7^(c"-i,pr\pri)]Vc", VC"-i)At 

n—l 

m 

+ j2{dt[D{c-\pe\prr - D{ce\p;:i-\pHrHc-,ve-rAt 

n—l 

m 

+ E {[DH,PiP^) - D{ce\pj:r\PHr]dt^c^,^e-rAt 

n—l 

-i[D{c^,pT.PT) - 7?(cr-\p^T'>^r2”')]v£'", vr) 

= -^10 + ^10 + ■ • ■ + Bfo- 
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It is easy to show that 

m 

\B\^ + Bl^\ < M{Ae + + hf+J2 (lir f + llVrf )At}, 

n—1 

\Bfo\ < M{At^ + hf+^ + hf + 

For Biq, using the mean value theorem and smoothness of Dij, we have 

d,[D,,{c-\pr\pr^) - 

< M{At+\dt^-^\ + \dtC~^\ + ic”-i| + ic”-^i + ie”^i + ic"”^i 

+ |Vdt7T"-i| + |Vdt?7”-i| + |V7 t"-1| + |V7t”- 2| + |Vr;”-i| + |Vr/”-2|). 
Then, 

m 

\Bl,\ < M{Ae + hf + + E (im? + l|V7T"|p)At} 

n—1 

m 

+e Y.{\\dtfrr +W^dtTT'-r)At. 

n—1 

Combining the above estimates and taking e small enough, we have 

m 

E At + ||ve™f < M{Af2 + hf + hl^ + + ||V^™-1||2 

n—1 

m m 

+ E (liril? + II )At} + £ E II At. 

n—1 n—1 

Multiplying (3.6) by (M + 1), adding the result to the above inequality, and 
taking s small enough, we obtain 

m 

E (IMtCf + ||Vd*^"|p)At + II vr f + l|rfcr-||2 

n—1 

< M{At2 + hf + hf + + ||V7r™-i||2 

m 

+ E(ll?”ll? + liv^”f + IMt^”f)At}. 

n—1 

Observing that 

m m 

lirf - ||/°f < lirf At + e^ ||dj"f At, 

n=l n—1 

we now have 



E (lldtrf + l|d*7r"||?)At + lir II? + WdtTT^f + Ik™||? 

n—1 

m 

< M{Ae + hf + hf+Y. (iirii? + iki? + )At}. 

n—1 
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Using Bellman’s inequality, it is shown that 

max ||(it7r™|p + max ||7r™||i + max ||^™||i 



Kn<m 



l<n<m 
^n\\2 I II cn\\2\ 



Kn<m 



+ E {\\dtnm + )At < M{Af + hf + Kf). 



(3.7) 



Let the parameters satisfy that 

At = o{hp), At = o{hc), k>2, I >2, = o{hp), hp = o{hc), 

and let At and h be small enough that 



(3.8) 



Mh~\At + hl + hi) < 1, Mh~\At + hi + hi) < 1. 



Now, we demonstrate the induction hypotheses (3.5) is true for r = m. 
By (3.7), (3.8), and inverse properties, we find that 

IMt^'”||L~(n) < Mhp^WdtTT^W < Mhp\At + h'; + hi) < 1, 

||V7r™|U=o(n) < M/i;i||V7r’"|| < Mh^\At + h^, + hi) < 1, 

Iiriki,(n) < K^WrWi < Mh-\At + h'; + hi) < 1, 

so (3.5) is true for r = m. 



Theorem 3.1 Suppose that the assumptions given in §1 hold and the space 
and time discretizations satisfy (3.8). Then there exists a constant M such 
that, for {hc,hp) sufficiently small, 



max ||c" 

0<n<T/At 

T/At 

+ E 



cni 



max 

0<n<T/At 

TjAt 



Ib”- 



Pf)\\lAt+ E llb(c' 



-P^lli + \\dt{p^-Pf)\\^ 

l<n<i /At 

- C^)f At < M{Af + hf + hf), 



n—1 



n—1 



where M is independent of At, he, and hp. 



Remark. In addition to the optimal estimates in the L°°{J; H^{id)) 
norm, we also obtain an estimate of dt(p" ~ the L°°{J; L'^{Tl)) norm 

and in the L^{J;H^{fl)) norm, which cannot be found previously. This 
should attribute to the special technique for establishing error equation (3.4). 
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Abstract 

In this paper we consider an integrated model for single-phase fluid 
flow in elastic porous media. The model and mathematical formula- 
tion consist of mass and momentum balance equations for both fluid 
and porous media. We propose a mixed finite element scheme to solve 
simultaneously for the porous media displacement, fluid mass flux, and 
pore pressure. A prototype simulator for solving the integrated pro- 
blem has been built based on a finite element object library that we 
have developed. We will present numerical and sensitivity results for 
the solution algorithm. 



KEYWORDS: geomechanics, fluid flow, elastic deformation, porous media 

1 Introduction 

Mechanical processes in porous media involve two basic elements: fluid flow 
and rock deformation. It is important to address the coupling of fluid flow 
and rock deformation for stress-sensitive reservoirs in reservoir simulation. 
In stress-sensitive reservoirs, the change in fluid pressure, which is caused 
by fluid injection or production, perturbs the stress state of the rock. The 
change in stress state triggers the rock deformation which, in turn, affects 
the fluid flow processes by changing the volumetric behavior of the reser- 
voir fluid and rock properties. In most conventional reservoir simulations, 
one only considers the impact of rock deformation on fluid flow through the 
pore compressibility which is expressed as a function of the fluid pressure. 
This simplistic approach is not adequate for simulating the coupling effects 
in stress-sensitive reservoirs. Moreover, reservoir simulations which are based 
on this simplified model can lead to wrong predictions for reservoir perfor- 
mance. Consequently, it is necessary to develop a coupled model where the 

Z. Chen, R.E. Ewing, and Z.-C. Shi (Eds.): Proceedings 1999, LNP 552, pp. 80-^^ 2000. 
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dependence of flow and deformation on each other can be modeled simulta- 
neously. 

The key issue in a coupled model is how to describe the interaction bet- 
ween the flow and deformation. The concept of “effective stress”, which is 
defined as the total stress of rock minus the fluid pressure, was first propo- 
sed by Terzaghi to take into account the effect of fluid pressure on the rock 
deformation m- Then this concept was generalized for the coupled model 
of fluid flow and rock deformation by Biot in a number of papers PI2I01I1I, 
into what is called “poro-elasticity theory”. The poro-elasticity theory has 
been widely used in civil, mining, petroleum, and environmental engineering 
for several decades. While considering the impact of the rock deformation on 
the processes of fluid flow, the rock properties, such as porosity and permea- 
bility, may change significantly with the variation of both fluid pressure and 
the stress state PEE]. The constitutive relations which describe the depen- 
dency of the rock properties on the deformation are derived based on various 
experimental observations and physical assumptions PPEIEIIEIII^. Pri- 
marily, the effect on the volumetric behavior of fluid flow is widely used in 
developing coupled models, in which the pore compressibility depends on not 
only fluid pressure but also the mean stress of the rock PODj. 

Generally, the mathematical models for solving the coupled problems 
are derived from physical principles based on certain physical assumptions 
p El ma DU cia cni The mathematical formulations are composed of a 
system of mixed-type nonlinear partial differential equations because of the 
coupling terms in the equations which govern flow and deformation. Nume- 
rical solution algorithms for solving such a system are difficult because the 
algorithms need to address the mathematical characteristics of both flow and 
deformation problems and, at the same time, to handle the coupling terms 
properly. Standard finite element methods have been used to solve the cou- 
pled system ElOj- However, the application of standard Galerkin 

methods in solving fluid flow problems raises concerns because of the loss 
of local mass conservation which is an important issue in reservoir simula- 
tion. Osorio used a finite difference method to solve the same problem in a 
sequential fashion, which preserved local mass conservation EH. However, 
the simultaneous solution procedure is more favorable compared with the 
sequential solution procedure due to computational efficiency. 

In this paper we employ the mixed finite element method to discretize 
the governing equations in which the Galerkin method is used for the defor- 
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mation equation and the lowest order Raviart-Thomas element is used for 
the fluid flow equations to conserve mass locally |0| • The numerical problems 
are solved in a fully coupled and fully implicit fashion, which requires an 
efficient iterative solver. The research simulator has been built up based on 
the finite element objects which have been developed in an object-oriented 
fashion The simulator has solved several test problems and field-scale 
simulation is ongoing. The primary numerical results show the strength of 
the solution procedure. The rest of the paper is organized as follows: In §2, 
we derive a mathematical formulation for fluid flow in elastic porous media 
from basic physical principles. In §3, we propose numerical schemes to solve 
mathematical problem. In §4, we briefly discuss our simulator development 
strategy. In §5, we show numerical results for solving an axis-symmetric pro- 
blem by using the simulator. In §6, we draw some conclusions and identify 
the directions for future work. 



2 Mathematical Model 

In this paper, we make the following basic assumptions: (1) temperature of 
the reservoir is constant, (2) there is no mass exchange between the rock 
phase and fluid phase, and (3) the rock is elastic material. The governing 
equations of coupled rock deformation and fluid flow are derived from the 
following physical principles: mass conservation and momentum balance for 
both fluid and rock phases. Since there is no mass exchange between the rock 
and fluid, the mass is conserved in terms of phases: 

+ ^- (pf^Lf) =qj, ( 2 . 1 ) 

and 

+ V • {pr{l - = 0. (2.2) 

Equations (2.1) and (2.2) are the mass conservation for fluid and solid phase, 
respectively. The subscripts / stands for fluid phase and r for rock phase. 
In equations (2.1) and (2.2), </) stands for the porosity, p stands for the mass 
density and qj stands for the flow rate of a source/sink term. The momentum 
balance for fluid is interpreted as Darcy’s law in terms of fluid velocity relative 
to the rock velocity: 



v = (l){vf-v^) = - 



k 

^ (Vp-kp/gVD) 



(2.3) 
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where /r stands for fluid viscosity, k for the rock permeability which is a tensor 
and g denotes the gravitational acceleration. The momentum balance for the 
solid phase is expressed by the steady state force balance: 



V-^=g„, (2.4) 

where a stands for the stress state of the solid phase and is a body force. 

According to the poroelastic theory the effective stress is used in 

describing the stress-strain relationship: 

g' = 2Gg+XV-u, (2.5) 

where a' stands for the effective stress, u for the displacement, e for the 
strain, G for shear moduli and A for the Lame constant. The elastic strain- 
displacement relationship is defined as 

£= (Vu-H Vu^) /2. (2.6) 

The effective stress a’ , is defined as follows according to PIIS|: 

= — apJ, (2-7) 



where p is fluid pressure, a is a coefficient which is positive but less than 1 and 
S is Kronecker S. Generally, a could be a function of the fluid pressure and 
rock deformation. However, it is reasonable to assume that a is a constant for 
single phase fluid flow in elastic porous media. Combining equations (2.4)- 
(2.7), we obtain the governing equation for the rock deformation in terms of 
rock displacement u and fluid pressure p: 



V • [G(Vm -k Vu^)] -k V(AV • u) + y{ap) = q^. (2.8) 



In this paper, the effect of the rock deformation on the flow processes are 
treated through: the dependency of the porosity on both fluid pressure and 
mean stress, and Darcy’s law is valid for the fluid velocity relative to the solid 
velocity. Substituting equation (2.3) into (2.1), we obtain the following form 
for the fluid mass balance equation: 

-kp/d>V -k V • (p/u) = g/, (2.9) 

where the total derivative is defined as d/dt = ddt -k • V. From equation 
(2.2), we obtain the following relationship: 



V • 



Vb dt ’ 



( 2 . 10 ) 



84 



Chen, Ewing, Lyons, Qin, Sun, and Yale 



where Vt stands for the bulk volume of the rock phase. Since the porosity is 
defined as the ratio of pore volume Vp to the bulk volume Vb, we obtain the 
following relationship: 



# _ _ 1 dVb\ 

dt dt \Vb ) \Vp dt Vb dt ) ' 



( 2 . 11 ) 



By substituting equations (2.10) and (2.11) into equation (2.9), we obtain 



PfCf) 



( ^ 

\Pf dt 



Vp dt ) 



+ V - {pfv) =qf. 



( 2 . 12 ) 



Generally the fluid density is a function of fluid pressure and the compressi- 
bility of the fluid is defined as c/ = ^ . The change of the pore volume 

Vp is caused by both fluid pressure and mean stress of the rock phase when 
considering the coupling effects. Therefore, we obtain the following relati- 
onship: 



1 


dVp 


= — 1 


fdVp^ 


] - + 


1 


fdVp^ 






dt 




\ d<J y 






\dpy 


Kdt’ 



where the mean stress a is defined as 



(2.13) 



a = tr{(j)/3. 



(2.14) 



Then we define the compressibilities as follows: 



1 

Vp 




and 



1 

^pp ~ 

Vp 



\ dp ) 



G 



(2.15) 



In equation (2.15), Cp„ and Cpp are compressibilities of the pore volume. The 
first letter in subscripts denotes the type of volume (e.g. “p” for pore vo- 
lume or for bulk volume) and the second letter refers to the independent 
variable (e.g. “p” for fluid pressure and “cr” for the mean stress of the rock 
phase). We can relate these compressibilities to the experimentally mea- 
surable compressibilities: the compressibility of the solid grains and the 
compressibility of the bulk volume with respect to mean stress Cba m-- 



Cpcr — 




and 



^pp — 



0 



(2.16) 



Combining equations (2.13) and (2.16) with equation (2.12), we obtain 
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where c* and c\ are given as follows: 



Ct = Pf {Cba - (1 + (fycr + 4>Cf) , (2-18) 



and 

Cl = p^ {c-ba ^r) • (2-19) 

We can obtain a relationship between volumetric strain and mean stress from 
equations (2.7) and (2.5): 

V • u 

a = = + ap. (2.20) 

Cba 

Because we are only considering elastic (small) deformation, it is reasonable 
to neglect the higher order term in the total derivative then ^ = ■§[■ 
Combining equations (2.9), (2.18), (2.19), and (2.20), we obtain the governing 
equation for fluid flow as follows: 






(YdM) 

V Cbc ) 



d{ap) 



f dp 



(2.21) 






where if = pfk/ p,f. 

Equations (2.8) and (2.21) form the system of governing equations for 
single phase fluid flow through elastic rock. The primary variables are the 
displacement u, mass flux of the fluid / and fluid pressure p. 

To determine the solution of equations (2.8) and (2.21), we define the 
following boundary and initial conditions: 



U = Ua 

G{Vu + Vu^)n + AV ■ un = 
P = Pd 

K{'^P + Pfg^D) n = 

P = Po 

U = Uq 



on Trd, 
on 

on Tfd, 
on T/„, 
in flo at f = Oj 
in 12o at t = 0, 



where stands for the solution domain, dfl stands for the boundary of the 
solution domain. Ti^; U Ti„ = T 2 d U T 2 „ = dfl. The subscripts r and / 
stand for the boundary condition in terms of the rock deformation and fluid 
flow, respectively. The subscripts d and n denote the Dirichlet and Neumann 
boundary conditions, respectively. 
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3 Numerical Solution Algorithm 



To solve the system (2.8) and (2.21) numerically by using the mixed finite 
element method, we need to write the system in a weak formulation. To this 
end, we shall use the following standard notation for Sobolev spaces and their 
norms: 

U = {uG I M = on r Id}, U = {uG | u = 0 on T id}, 

V = {/ e L2(fl)3 I V • / e L2(fl), / • n = /„ on Taj, 

V = {/ e I V • / e L^n), / • n = o on T 2 „}, 

w = L^{n). 

Now, the weak formulation for the coupled system (2.8) and (2.21) seeks 
(Mj /)P) S W X V X W satisfying 



a(w, v) + {ap, V • u) - {ap, v ■ ll)ri„nr2„ = + (m„, w)ri„ > 

'f) - (P, V • ^) = -{Pfg^D, 'f) - {pd, ^ , 




(3.1) 

for all {v,ip, k) G U X V X W. 

Let h G (0, 1) and 7)i be a family of regular triangulations of Lt. Let Uh G 
U be the piecewise continuous linear finite element spaces and (V/i,W/i) G 
(V, W) the lowest order Raviart-Thomas mixed finite element spaces m 
The finite element approximation {uf^, f^,Ph) G Uh xVh x Wh is the solution 
of the discretized version of (3.1): Find {uh,f^,Ph) G Uh xVh x Wh such 
that for all {vh,^^^, Kh) GUhxVhxWh 



a{uh,Lh) + (oiPh, V • Ud) - {aph,v^ ■ n)ri„nr2„ 



(9„,Ld) + {Uu:Lh)r,„ > 







= -(9/,K/l)- 



= - (pd,±^-r^^ , 



(3.2) 

A backward Euler scheme is employed for the time derivatives in (3.2) and 
a Newton iteration is used at each time step due to the nonlinearities in the 
governing equations. Here we present the simultaneous solution algorithm 
for solving the system as follows: 



1. At time t = 0, the increment in displacement Am is given as 0 and the 
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fluid pressure is known from the initial conditions for the model. The 
compressibilities are given as constants in this paper. 

2. Update time to t = t + At. 

3. Compute the porosity, fluid density and viscosity based on pressure and 
temperature. 

4. Apply Newton’s method for system (3.2) to generate Jacobian matrix. 

5. Solve the linear system iteratively and then update the primary varia- 
bles. 

6. Repeat steps 3-5 until solution converges. 

7. Go back to step 2 to update the porosity and the fluid properties and 
repeat the above process until time t = T. 

4 Simulator Development 

In the development of numerical simulation technologies, a flexible numerical 
tool is necessary to quickly test proposed numerical methods and algorithms 
for solving mathematical problems describing different physical problems. 
Therefore, the tool should be flexible in the choice of problems and finite 
element methods and its software components should be reusable. Moreover, 
the tool should also be simple to interface with other software packages. 
Based on the above considerations, we divided the simulator development into 
two steps: (1) develop object-oriented finite element objects (FEO) which can 
be utilized to solve various model problems by using different finite element 
schemes, and (2) build a simulator for solving specific physical problem, such 
as the coupled problem, by using these finite element objects. 

In developing FEO, we focused on the implementation of the finite ele- 
ment formulation and interfaces to the mesh generator and linear solver, in 
addition to the geometry and partial differential equations [E|. There are 
two basic components in implementing the finite element method: (1) varia- 
tional formulations which consist of a set of linear and bilinear forms; and 
(2) shape functions which generate the approximate solution. Consequently, 
we identified the objects as the bilinear forms, linear forms, piecewise poly- 
nomial spaces, boundary integral forms and essential boundary conditions. 
Object-oriented programming techniques, especially polymorphism, are used 
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to uniformly process these different objects. A user interface specifies these 
objects and combines them to form a finite element problem. 

Based on FEO, we need three steps to construct the simulator for solving 
the coupled problem. First, we need to develop a class which implements the 
constitutive relationships for updating the fluid and rock properties. Then 
we form a finite element problem for the coupled problem by using FEO. 
Finally, we need to develop a driver which implements the Newton iteration. 



5 Computational Examples 



A simulator for solving the coupled fluid flow and rock deformation problems 
has been developed. The simulator has been tested with various problems 
in order to debug and to improve the overall efficiency. In this paper, we 
show the numerical results for solving an axis-symmetric problem with known 
analytical solution. It is of practical importance to solve the axis-symmetric 
problem because the axis-symmetric problem is of great interest to study the 
single well injection or production case where the coupling effect can have a 
large impact on reservoir performance. Moreover, it is numerically efficient to 
use a 2-dimensional simulator to simulate 3-dimensional reservoir behavior. 
In the following test problem, we check the transformation which transforms 
the 3-dimensional problem into an axis-symmetric problem and to check the 
convergence rate of the proposed numerical algorithm. 

By assuming that all coefficients and the solution are axis-symmetric, 
i.e., -§g = 0 and ug = 0, we can reformulate the system (2.8) and (2.21) in 
cylindrical coordinates. First, we have the following transformation formulas: 

V-M= i + ifwe = 0, 

+ If e, = Vr.i; if W = »> 

where = (cos 0, sin 6*, 0), = (0, 0, 1). Thus we obtain 



V • [G{Vu + Vm"^)] = i 
(rG(V. 



~^{2rG^) + -^{rG^ 
ar^ or ^ az^ oz 



rG^)-^Gur 



= iy 

f ^ rz 



^))- 



-^GUrg , 
r 



V(AV -m) = 



9r 



-k 



d 



A f d(rur) 
dr 

A f d{rur) , 



d{ru„) 

d{ru„y 

dz 
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Multiplying system (2.8) and (2.21) by r, we have 



~^rz ■ (?’G(VrzU+ VrzU^)) + 

-rVr-z • (ru)^ - rVrzi.ap) = rq^, 

r/ + rK (VrzP + pfgVrzD) = 0, 






'^rz • (ru) 



^bc 



+ Cir 



d{ap) 

dt 



- 



• (^/) - 



-rgy. 



(5.1) 



Since the axis-symmetric problem defined in 2-dimensional rz coordinates 
is the transformation of the 3-dimensional problem in xyz Cartesian coordi- 
nates, there is no boundary conditions specified on r = 0. 

We specify the solutions for u, f, and p as follows: 



u = te * (a;(x^ -I- y'^)e^,y{x^ + y‘^)e^ , {x^ + , 

p = (x^ -I- ?/^)^e^sint, (5.2) 

/ = — {Ax{x“^ + y^),4p(x^ -I- y^), (x^ -I- y^)^)^ sint. 



Assuming all coefficients of system (2.8) and (2.21) are constants, (u,f,p) is 
the solution of the system with the following right-hand sides: 



In 



= -G 

( 

-A 



16x6^ -I- x(x^ 
16ye^ -I- yixP' 
y A{x^ + y^)e^ + 16(a:" 



8xe^ 

8ye^ 



8a:(a 

8y(a 




V (a: +y )" 



sint. 



y 4(x^ + y^)e^ + 2(a:^ + y^)^ 

-y/= ^ [4(x^ + y'^)e^ + 2(x^ -I- y^)^z] (1 - t)e~ 

+(cio — C 2 ){x^ + y'^Ye^ cost + [16 + (x^ + y^)] (x^ + y^)e^ sinf. 



The initial conditions for this analytical solution are: 



Mq = 0 and po = 0. 

The analytical solution (5.2) is transformed to the axis-symmetric case: 



u = 



r^e^te * 
r'^z'^te* 



and 



p=r^e^sinf and / = 



— 4r^e^ sin t 

_j,4gZ ginf 



(5.3) 

(5.4) 
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Substituting (5.3) and (5.4) into (5.1), we obtain the following right-hand 
sides and qf. 



q =-G 

—U 

-A ( 



16re^ -I- r^(e^ -I- 8 z) 
-I- 16r^z^ -I- 4r^‘ 
8 re^ -I- 8r^z 

4.^2 gZ _|_ 2 j ,4 



te~^ 

4^3 



te* — a \ ^ I sint, 



—qf= [4r^e^ -|- 2r^z] (1 — t)e * -I- {c\a — C 2 )r^e^ cos t 
+ [I 6 -I- r^] r^e^sint. 

The resulting stress boundary condition is of Neumann type: 



[rG{V rzU + ^rzi^) + AVj .2 • (ru)] • n = 

0g3gZ j,4gZ _|_ 4 j ,4^2 

+ 4r^z^ 4r^z 



Gte- 



^ • Z1 + A(4r^e^ -I- 2r^z)te *n. 



We have solved problem (5.1) by using the simultaneous solution algo- 
rithm with both triangular and quadrilateral elements. The L 2 average trun- 
cation error is calculated with respect to the analytical solution (5.3) and 
(5.4). We carried out four runs by refining the mesh size by a factor of 2 
and refining the time step size by a factor of 4. Table 1 lists the L 2 average 
truncation errors for each primary variable in four runs using quadrilate- 
ral elements. It is clear that the proposed numerical algorithm is 2nd-order 
accurate. 



6 Summary and Future Work 

A mixed finite element method has been employed to solve single phase fluid 
flow through elastic porous media. A simultaneous solution algorithm has 
been implemented using FEO to solve this coupled problem in a fully coupled 
and fully implicit fashion. The simulator has been tested with various test 
problems and the numerical results presented in this paper show the overall 
numerical accuracy by the proposed algorithm is second order. The iterative 
solution procedure which can handle the resulting linear system from the 
coupled problem needs to be further studied and developed in order to carry 
out field-scale reservoir simulations. 
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Ur 


Uz 


P 


f -n 


2x2 


0.098 


0.155 


2.556 


0.218 


4x4 


0.023 


0.036 


0.667 


0.059 


16 X 16 


0.005 


0.008 


0.126 


0.015 


32 X 32 


0.001 


0.002 


0.038 


0.004 



Table 1. Average truncation error. 
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On the Discretization of Interface Problems 
with Perfect and Imperfect Contact 

Tatiana Chernogorova Richard E. Ewing 
Oleg Iliev Raytcho Lazarov 



Abstract 

A second-order difference scheme for a first-order elliptic system 
with discontinuous coefficients is derived and studied. This approxi- 
mation can be viewed as an improvement of the well-known scheme 
with harmonic averaging of the coefficients for a second order elliptic 
equation, which is first-order accurate for the gradient of the solution. 

The numerical experiments confirm the second order convergence for 
the scaled gradient, and demonstrate the advantages of the new discre- 
tization, compared with the older ones. 

KEYWORDS: interface problems, second order discretization 

1 Introduction 

The single-phase fluid flow in a fully-saturated inhomogeneous porous media 
that occupies a bounded domain O C i?", n = 1,2,3 is often modeled by 
first-order system 

V-u=/(a;), u=—K'Vp for a; G O, (1.1) 

subject to various boundary conditions. Here u is the Darcy velocity, p is 
the pressure, and K(x) is the permeability tensor. The first equation is 
the continuity equation, while the second one expresses the relation between 
the pressure gradient and the velocity by the linear Darcy law. We assume 
that K(x) is a diagonal matrix with positive elements which may have jump 
discontinuities across given surfaces T called interfaces. In this paper we 
consider two types of conditions on the interface T: (a) a perfect contact: 

[p] = 0, [u • n] = 0 for ^ G T; (1.2) 

and (b) imperfect contact: 

(u • n)+ = (u • n)_ = a{(p+ — p_) for ^ G T. (1.3) 
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Here n is the normal to the interface F unit vector (with fixed direction), 
9±{0 denotes the right and left limits of the function g at point and 
[ff] = g+-9-- 

Often the velocity is eliminated so that the system II I . I |l is reduced to 
a second-order elliptic equation for the pressure. A modified finite volume 
discretization for such a class of elliptic problems has been introduced and 
studied in 00] and numerical experiments, confirming second-order con- 
vergence for the pressure, have been presented. In the present paper we 
continue the research in 00 ] by introducing a new approximation of the 
velocity and studying its properties. We also study the relative accuracy for 
the approximate pressure near the interface, and discuss its superconvergent 
behavior in a particular case. We restrict our consideration to a class of inter- 
face problems for which: (1) each interface is parallel to a co-ordinate axes, 
and (2) the velocity is continuously differentiable in the normal direction to 
the interfaces. Under these two assumptions, we construct a second-order 
approximation of the system dnD on a staggered grid so that the pressure 
values are computed at the centers of the control volumes, while the values 
of the normal component of the velocity are computed at faces of the cells. 
The numerical experiments show that the derived scheme has second order 
convergence for both, the pressure and the velocity, and is superior to the 
schemes with harmonic averaging, which exhibit second-order convergence 
for the pressure and first order convergence for the velocity. The computati- 
ons also show that for a smooth velocity the proposed discretization gives a 
considerably more accurate pressure, compared to the results of the scheme 
with harmonic averaging of the coefficients. 

2 Discretization of the Interface Problem 

Here we present in detail the discretization of the one-dimensional (1-D) pro- 
blems with perfect and imperfect contacts. The discretization of the multi- 
dimensional problem on a tensor-product grid is just a tensor product of 1-D 
discretizations. 

2.1 Discretization of the Perfect Contact Problem 

In order to illustrate our approach, we first consider the perfect contact 
problem (ECU) - (O) in the one-dimensional case. We introduce a stan- 
dard uniform cell-centered grid, Xq = 0,xi = hf2,Xi = Xi-i + h, i = 
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2, . . . ,N,xn+i = Ij where h = 1/N. The internal grid points can be con- 
sidered as centered around the control volumes Vi = where 

Xi+i = Xi + ^h, Xi_i = Xi — ^h. The values of the pressure and of the 
right hand side are defined at the grid points Xi and are denoted by Pi, fi- 
The values of the velocity are defined at the points 1 and are denoted by 

1 . Non-uniform grids can be treated in a similar way. Note, our approach 
is defined locally, at a particular control volume level, and it can work with 
standard vertex-based grids as well. 

The second-order discretization of the continuity equation in (I I . I p is 
straightforward : 

1 r^+i 

Ui+i - Ui_i = h Pi, pi = - f{x)dx, 1 = 1,2,..., N. (2.1) 

““-i 

Next, consider the Darcy law, rewritten in the form 

dp u(x) 
dx k{x) ’ 

and integrate this expression over the interval {Xi,Xi+i): 



- {Pi+i - Pi) = 






u(x) 

k(x) 



dx. 



( 2 . 2 ) 



We assume that the velocity u{x) is two-times continuously differentiable on 
the interface, so it can be expanded around the point Xi.^.i in Taylor series: 



U{X) = Ui+l 



+ (x-x,+ i) 



dui + 1 
2 

dx 



{X - Xi+l^)"^ d’^ujif} 

2 dx“^ 



T] G (Xi,Xi+i). (2.3) 



After replacing the first derivative of the velocity at by a two-point 

backward difference, we get the following approximation of (E2J: 



-{Pi+i-Pi) = M.+ i 




dx 

k{x) 




jx-Xj+i) 

k{x) 



dx + 0{h^). 
(2.4) 



Finally, we rewrite this equation in the following basic form: 



,H Pi+l ~ Pi I / ^ I / 

-K+i r = Ui+i -ka,+ i(u.+ i -Ui_l) + ^pi, 



(2.5) 



where ipi = 0{h?') and 



= I -H 



1 dx 



k{x) 



1 f^i+^X-Xi+l 

Oi+i = kf 1 — / — dx. 

Jxi k{x) 



( 2 . 6 ) 
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Here k^i is the well-known harmonic averaging of the coefficient k(x) over 
the cell (xi,Xi+i), which has played a fundamental role in deriving accurate 
schemes for discontinuous coefficients (see, e.g. 001171). This presentation 
of the velocity u{x) is a starting point for our discretization. Since we have 
assumed that the velocity is smooth, then the consecutive terms in the right- 
hand side in (E3D are 0(1), 0{h) and 0{h^), respectively. Truncation of this 
sum after the first term produces the well-known scheme of Samarskii jSj with 
harmonic averaging of the coefficient. This scheme is 0(/i)-consistent for the 
velocity at the interface points and second-order accurate for the pressure 
in the discrete i7^-norm. Further in the text, we call this scheme harmonic 
averaging or HA scheme. 

Now we derive an 0{h?) consistent scheme for the velocity by disregarding 

only the '0j-term in (I2.t>ll . We denote by Pt the approximate pressure at the 

grid points in order to distinguish it from the exact values pi. Similarly, 

the approximate velocity is denoted by U~i, etc., where the sign ± 

'*'+2 ^2 

indicates the right and left values of the flux at the point. Note, that the 
exact fluxes are continuous, while the approximate fluxes may have different 
t/^i values. Thus, we get 



i.H Pi+l ~ Pi 


- -ka,+ i(C/,_^i 




(2.7) 


'■'i+i 


h 


,H Pi Pi -1 


= C-.+“,-4(C/-s 




(2.8) 




h 



In summary, the equations g.CT, and approximates the first-order 
system a with loeal truneation error 0 (h'^ ). Further, we refer to this 
approximation as a scheme with improved harmonic averaging or IHA scheme. 

Now we transform the discretization to a more suitable form. Subtracting 
(E3) from dZZI we get 



(l-ka,+ i -a,_ 





H Pi+l ~ Pi I ,H Pi ~ Pi-1 
h h 



(2.9) 



Combining this with the discretization of the continuity equation (O) , we 
obtain 




= P^■ ( 2 . 10 ) 
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On the other hand, solving the system (E2J, (EHI) for U 1 and U^i, we get 



= 



= 



‘1- r, 



Pi+1 — Pi 






- k 



H 



1 + fli+l — fli-l 






^l+l “ -Pj 



-A:, 



H 



P - P. 



1 + ai+i — a,_ 1 



P* - Pi-1 



- (l + o,+ i) 



( 2 . 11 ) 



The new scheme approximates the velocity with second-order accuracy, 
independently of the positions of the discontinuity of the coefficient k{x). 
The price we paid is the necessity to evaluate the expressions k^i,k^_i 
and ai^.i,a,_i with an error no larger than 0{h?). Let a point ^ where 
the coefficient k{x) is discontinuous be in the form ^ = Xi + 9h for some i 
and 0 < 0 < 1. Now we consider particular realizations of this scheme. The 
approximation of the integral in k^ i is done by splitting it into integrals over 
(xi,^) and and then applying the trapezoidal rule for each integral. 

This approach will produce an accurate enough evaluation of k^ i : 






k^-o 



i-e 



W + 1 



H+0 



( 2 . 12 ) 



Note, that fcj_o and fcj+o are known from the interface condition. 

Further, we continue with the second integral in We again split the 

integral into two integrals and apply the trapezoidal rule for each of the two 
integrals: 



1 

h? 






X - a;,+i) 


dx = 


1 


P {x - x^+i 


k(x) 


/i2 


lx, k{x) 


'9-Q.b 


0.5' 




1-9 / 0.5 


0 

1 


ki , 


2 Ui+i 



dx 



1 

h? 
-0.5 
%+o 



fXi+i (3. - x,+ l) 



k{x) 



dx 



0{h^). 



(2.13) 

In the case of a piece-wise constant coefficient k{x) these formulas are exact 
and reduce to 

-1 



= 



1 - e 

ki-\-l 



and a. I 1 = 



16l(l-6»)(fc. -A:,+0 



(2.14) 



2 (1 — 9)ki + 9kij^i 

Obviously, if the point of discontinuity ^ is a midpoint of the grid, i.e. ^ = 



^ 1 , then 0 = 1/2 so that itz. 1 411 reduces to 



k!i,_ =2(1 



w+1 



and a., 1 = 

‘T o 



1 / fc, — k: 



4 V fc, -I- ki. 
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Remark 2.1 Note that if f{x) = 1 then u"{x) = 0 and the local truncation 
error is zero. Thus, HA scheme reproduces exactly piecewise linear solutions, 
while the IHA scheme reproduces exactly piecewise quadratic solutions. 



2.2 Discretization of the Imperfect Contact Problem 



Discretization of imperfect contact problem in the case when interfaces are 
aligned with grid nodes, is studied in |S|. A harmonic averaging type of 
discretization for interfaces aligned with control volume faces has been di- 
scussed in Below we derive improved discretization for the case when 
interfaces are orthogonal to a co-ordinate axis. Consider the 1-D imperfect 
contact problem Cl, dllSJ- The discretization in this case is derived in a 
similar way as in the case of perfect contact, so we shall only list the final 
results. The second-order discretization of the continuity equation in dU) is 
almost the same as in the perfect contact case: 



U 1 - 17+1 = h ipi, (Pi = y 

’+ 2*2 ll 



1 r^+h 



f{x)dx, i = 1,2,..., N. (2.15) 



lx 1 



A second order consistent discretization to Darcy law is given by: 

Pi+l — Pi 






h •+- 

For piecewise constant coefficients we have 






(2.16) 



1 ={9- 0.5) + ^ ((1 - Ofh - 9^h+^) , 



where 



= fcj(l — 9)ha^ + kiyi{ki + 9ha{), k^i = 



1-9 9 

kiyi ki 



A+i = fcf 1 



k 



H 



+ 2 I ha,,i 



+ 1 



Note, in this particular case, we have second order of approximation for the 
velocity near the interface when 9 ^ 0.5, and third order of approximation 
for the velocity on the interface when 9 = 0.5. 



3 Numerical Experiments 



Numerical experiments were performed in order to study the behavior of the 
proposed discretization (IHA) and to compare it with the widely used discre- 
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tization with harmonic averaging of the coefficients (HA) . We approximately 
solve model problems with known analytical solution in order to assess the ac- 
curacy of the numerical solution. Three test problems have been considered. 
First, we solve a 2-D perfect contact problem with different permeabilities in 
4 subregions, and investigate the cases when the interfaces are aligned with 
the control volume faces (i.e. 6 = 0.5), and when the grid is not aligned 
with the interfaces (i.e. 6 yf 0.5). We show that IHA ensures second order 
convergence for the velocity in both cases, while HA is first-order accurate for 
the velocity in both cases. Both schemes are second-order accurate for the 
pressure, however the constant of convergence in IHA is much smaller. In the 
second test we solve a 2-D perfect contact problem with different permeabi- 
lities in 16 subregions. Superconvergence for the pressure (namely, accuracy 
of 0{h^)) is observed for this particular problem in the case of a constant 
permeability in the whole domain fl. This is due to some symmetry and some 
cancellation of reminder terms. Numerical experiments show that for smooth 
velocity the IHA scheme preserves this superconvergence for the pressure in 
the case of discontinuous piecewise constant coefficients, while HA does not 
preserve it. Again, IHA ensures second-order accuracy for the velocity, while 
HA is only first-order accurate for the velocity. In the third example we solve 
a imperfect contact problem. 

We should note that these computations do not guarantee high order of 
convergence for solutions with low regularity, for example, those produced 
by discontinuous coefficients and f{x) = 1. Such solutions belong to the 
Sobolev space iJ^+'>'(n), where 7 > 0 depends on the ratios of the jumps at 
the intersection points of the interfaces and can be very small. 

Example 1. 2-D perfect contact problem in a unit square with interfaces 
at X = ^, y = r] (4 subregions). The permeability is a piecewise constant in 
each sub-domain and takes values k = 1, 10“^, in the 4 subregi- 

ons, counting from left to right and from bottom to top. The exact solution 
in this case is: p®® = ^sin (^) (x — 0 iv ~ v) (l + x^ -I- Results from 
computations are presented in Table 1 and Table 2. The following notations 
are used: HA and IHA denote harmonic averaging based discretization and 
improved HA discretization for the case of aligned interfaces (i.e. 9 = 0.5), 
respectively. The notation ”0—” in front of the scheme’s notation is used 
when 9 0.5. Maximum errors for the pressure and their ratios are presen- 

ted in Table 1. It is seen that both schemes converge with second order in 
both cases: aligned and non-aligned interfaces. The non-monotonic behavior 
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of the convergence in the non-aligned case can be explained by the fact that 9 
varies from grid to grid, taking values larger or smaller than 0.5. In this way, 
the maximum value of the solution in different subregions is involved in the 
error estimate. Table 2 summarizes maximum errors for the velocity at the 
vertical interface (or at the nearest vertical line x = x,^i in the non-aligned 
case). It is clearly seen that IHA is second-order accurate for the velocity, 
while HA is only first-order accurate for it. Again non-monotone behavior of 
the convergence, related to alternating values for 9 on the consecutive grids, is 
observed. A monotone convergence will be observed for fixed 9, for example, 
on grids contains 12 x 12,42 x 42, 162 x 162 nodes. 



Example 1.2-D perfect contact problem with 4 subregions. 



Grid 


C= 5,^7= : 
HA scheme 


i, (aligned) 
IHA scheme 


? 3’^ 3’ 

^-HA scheme 


(non-aligned) 
5-IHA scheme 


12x12 


1.75d-2 


- 


3.34d-4 


- 


1.91d-2 


- 


4.48d-4 


- 


22x22 


5.97d-3 


2.9 


7.64d-5 


U 


9.56d-3 


2.0 


1.45d-4 


3.1 


42x42 


1.80d-3 


3.3 


1.94d-5 


3.9 


2.10d-3 


16 


4.02d-5 


3.6 


82x82 


5.03d-4 


3.6 


4.97d-6 


3.9 


7.03d-4 


3.0 


l.lOd-5 


3.7 


162x162 


1.36d-4 


5.7 


1.26d-6 


3.9 


1.64d-4 


13 


2.80d-6 


3.9 


322x322 


3.56d-5 


3.8 


3.17d-7 


10 


4.69d-5 


3.5 


7.19d-7 


3.9 



Table 1. Maximum errors for pressure and their ratios. 



Example 1.2-D perfect contact problem with 4 subregions. 



Grid 


HA scheme 


;, (aligned) 
IHA scheme 


? — siV — 3? 

^-HA scheme 


(non-aligned) 
5-IHA scheme 


12x12 

22x22 

42x42 

82x82 

162x162 

322x322 


4.20d-2 - 

2.14d-2 1.96 

1.08d-2 1.98 

5.40d-3 2.00 

2.70d-3 2.00 

1.35d-3 2.00 


1.18d-3 - 

2.73d-4 4.3 

8.07d-5 3.4 

2.20d-5 3.7 

5.77d-6 3.8 

1.50d-6 3.8 


2.19d-2 - 

2.83d-2 0.77 

5.62d-3 5.03 

7.11d-3 0.79 

1.41d-3 5.04 

1.78d-3 0.79 


5.26d-4 - 

1.27d-4 4.1 

5.30d-5 2.4 

5.44d-6 9.7 

3.93d-6 1.4 

3.77d-7 10. 



Table 2. Maximum errors for velocity and their ratios. 



Example 2. 2-D perfect contact problem in a unit square with inter- 
faces at cc = = 0.2, X = ^2 = 0.5, x = ^3 = 0.7 and y = rji = 

0.3, y = rj 2 = 0.6, y = ri 3 = 0.8 (16 subregions). The permeability is 
a piecewise constant function and in each sub-domain and takes values k = 
10 X {10, 10-3, 1, 10-^ 10-2, 10, 10-3, 1, 10, 10-3, 1, 10-3, 10-3, 10, 10-2, 1} in 
the 16 subregions, counting from left to right and from bottom to top with 
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exact solution p{x, y) = ^ {x - ^i) {x - ^ 2 ) {x - ^ 3 ) {y ~ Vi) {y ~ V 2 ) {y ~ Vs)- 
The computational results are presented in Tables 3 and 4. For comparison 
we have included also the results for the constant coefficient k{x,y) = 1 in 
the whole domain fl. Notation “case B” is used to denote columns with these 
results. One can observe superconvergence for the pressure in the continuous 
case, which is also exhibited by IHA in the case of discontinuous coefficients. 
Table 4 shows that IHA ensures second-order convergence for the velocity, 
while HA is only first-order accurate. In our numerical experiments the exact 
solution is zero at the interfaces, so the absolute values for the error there 
might be small, compared to absolute errors far from interfaces, while the 
relative errors can be very large. We can plot the relative error 
computed by HA and IHA, respectively. Here (xi,yj) are the centers of 
the cells in the rectangular grid. Qualitatively, the results look similar, but 
quantitatively they differ by orders of magnitude. The results show that IHA 
produces a considerably more accurate solution near the interface. 

Example 3. 1-D imperfect contact problem with interfaces at = 0.2 
and ^2 = 0.7 with an exact solution: 



p{x) = 



d 

k{s 1) (s -|- 2) 







+ ax + b. 



The parameters k,c,d vary for the different subregions. The constants in the 
imperfect contact interface conditions are ai = 10^, 0:2 = 10. Also, ■u(O) = 1, 
m( 1) = 0, and s = {1, 1, 1} , A: = { 1, 10, 10^} , c = { 10, 1, 10^} , in the three 
subregions, counting from the left. 



Example 2. 2-D perfect contact problem with 16 subregions. 



Grid 


k{x,y) = 1 
case B 


alig 

HA scheme 


ned 

IHA scheme 


12x12 

22x22 

42x42 

82x82 

162x162 

322x322 


1.75d-3 - 

2.89d-4 6.06 

4.18d-5 6.91 

5.64d-6 7.U 

7.33d-7 7.69 

9.34d-8 7.85 


3.91d-3 - 

1.49d-3 2.62 

5.07d-4 2.93 

1.55d-4 3.27 

4.39d-5 3.53 

1.19d-5 3.69 


1.65d-3 - 

2.87d-4 5.75 

4.17d-5 6.88 

5.63d-6 7.41 

7.32d-7 7.69 

9.35d-8 7.83 



Table 3. Maximum errors for pressure and their ratios. 



Example 2. 2-D perfect contact problem with 16 subregions. 
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Grid 


k{x,y) = 1 
case B 


alig 

HA scheme 


ned 

IHA scheme 


12x12 

22x22 

42x42 

82x82 

162x162 

322x322 


1.74d-4 - 

5.85d-5 2.97 

1.67d-5 3.50 

4.42d-6 3.77 

1.14d-6 3.88 

2.89d-7 3.94 


2.02d-3 - 

1.17d-3 1.73 

6.20d-4 1.89 

3.18d-4 1.95 

1.60d-4 1.99 

8.06d-5 1.99 


9.37d-4 - 

2.91d-4 3.21 

8.22d-5 3.54 

2.39d-5 3.44 

6.56d-6 3.62 

1.75d-6 3.75 



Table 4. Maximum errors for velocity and their ratios. 



Example 3. 1-D imperfect contact problem with 3 subregions. 







PRESSURE 






VELOCITY 




Grid 


HA scheme 


IHA scheme 


HA scheme 


IHA scheme 


12x12 


4.56d-3 


- 


2.47d-3 


- 


3.87d-2 


- 


2.49d-2 


- 


22x22 


1.33d-3 


3.4 


5.12d-4 


4.8 


9.70d-3 


4.0 


5.46d-3 


4.6 


42x42 


3.58d-4 


3.7 


9.77d-5 


5.2 


2.43d-3 


4.0 


l.lld-3 


4.9 


82x82 


9.24d-5 


3.9 


1.83d-5 


5.3 


6.07d-4 


4.0 


2.50d-4 


5.4 


162x162 


2.35d-5 


3.9 


3.27d-6 


5.6 


1.52d-4 


4.0 


3.55d-5 


5.8 


322x322 


5.92d-6 


4.0 


5.84d-7 


5.6 


3.80d-5 


4.0 


6.04d-6 


5.9 



Table 5. Maximum errors and their ratios. 



The results from computations are presented in Table 5. As it was men- 
tioned in Section 1^1 the IHA scheme approximates velocity at the interface 
with third-order accuracy in the case when the imperfect contact interface is 
aligned with cell faces. In this case, the HA scheme approximates the velocity 
at the interface to second order. This may explain the second-order conver- 
gence for the HA scheme, and the superconvergence for the IHA scheme. 
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Hyperbolic Integral-Differential Equations 
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Abstract 

The finite element method and its analysis for pseudo-hyperbolic 
integral-differential equations with nonlinear boundary conditions is 
considered. A new projection is introduced to obtain optimal con- 
vergence estimates. The present techniques can be applied to treat 
elastic wave problems with absorbing boundary conditions in porous 
media. 



KEYWORDS: pseudo-hyperbolic integral-differential equation, finite element, 
Sobolev- Volterra projection, convergence analysis 

1 Introduction 

We consider the finite element method for the pseudo-hyperbolic integral- 
differential equation subject to the initial and nonlinear boundary conditions 

q{u)utt = V • {a{u)'S/ut + b{u)'S/u 

+ /o c(M(r))Vw(r)(ir)-b/(M), x € n, t € J, 

+ Jo dr = g{u), x e 50, t e J, 

m(x, 0) = = 'I'(x), X € fi, 

where J = [0,T], O C (d > 1 is the dimension of the space) is an open 
bounded domain with smooth boundary 50, 7 denotes the outer-normal 
direction to 50, = 4>{x,t,u), c{u{t)) = c(t,T,u{T)) = c(t,T,x,u{x,T)), 

and q, a, b, c, /, g, <f>, and 'll are known functions. We also assume that the 
functions q, a, b, c, /, and g are smooth with bounded derivatives and there 
exist constants g» > 0 and a* > 0 such that q{x, t, ip) > q* and a{x, t, ip) > a*, 
Vx € t € J, Ip G R. 

Pseudo-hyperbolic integral-differential equations are often used in the 
fields of visco-elasticity, nuclear physics, and biological mechanics. For exi- 
stence, uniqueness, and continuous dependence of solutions, there are some 

Z. Chen, R.E. Ewing, and Z.-C. Shi (Eds.): Proceedings 1999, LNP 552, pp. 104-^^^ 2000. 

(c) Springer-Verlag Berlin Heidelberg 2000 
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investigations carried out in PI3; however, for their numerical approxima- 
tion, there is few works available Since the antihunt, diffusion, and me- 
mory (Volterra) term emerge at the same time in their formulation, together 
with nonlinear boundary conditions, the traditional Ritz projection 0, Ritz- 
Volterra projection |2|, and Sobolev projection |3j can no longer reflect this 
global property, so it is difficult to do an error analysis. In this paper we 
introduce a new projection to treat these difficulties, use integration by parts 
and an induction hypothesis reasoning to deal with the nonlinearity of these 
problems, and do a thorough and successful finite element (FE) numerical 
analysis for (1.1). 

An outline of the paper is as follows. In §2, a new projection is intro- 
duced and its approximation properties are studied. With the help of this 
projection, the FE method and its numerical analysis is investigated and the 
optimal convergence is established in §3. 

In this paper K is a generic positive constant and may be different each 
time it is used and e is an arbitrarily small constant. Let {4>,'4’) = Jq 
and {4>,ip) = JgQ(j)tjjdx', the norms in Banach spaces follow those in [4, 5]. 
We use the inequalities 

ab < to? + ^6^, |^|| 2 (an) < e|l WIP + K{e)\\(j>\\'^, 

\c^\r<K{T,r)MUi_, 0<r<3/2, ry^l. 

2 A New Projection 

Let p, C be a finite element space associated with a quasi-regular 

partition of 12 such that the elements have diameters bounded by h. Let 
the index of p be the integer k. It is frequently valuable to decompose the 
convergence analysis of the FE method by passing through a FE projection of 
the solution of the differential problem. To treat our problem, we introduce 
a new projection: u{t) : [0, T] — )> /i such that 

{a{u)y{ut — Ut) + b{u)V{u — u) -I- f* c(u(t))V(u — u)(T)dr, Vv) 

+A(ut — Ut, v) — (g(u) — g{u),v) = 0 \/v G p, t G J, (2.1) 

m ( 0 ) = ^h, 

where A is a positive constant assuring coercivity of the form and 4>/i is an 
appropriate approximation of 4> in /i. 

Lemma 2.1 There exists a unique u{t) G p such that (2.1) being satisfied. 
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Proof: We use an iterative procedure. Let G /i and define {V^} by 

(a(u)V(V"/ - ut) + b{u)V{VP - u) + f* c(u(t))V(VP-^ - u)(T)dr, Vv) 
+A(Vt^ - ut,v) - {g{VP-'^) - g{u),v) = 0 Vv G t G J, 

VP{Q) = ^h, Vp > 1. 

(2.2) 

For p > 1, the unique existence of VP comes from the general theory of initial 
value problems of ordinary differential equations. Setting ZP = — VP 

and V = ZP , it is natural to see from the above relation that 

a*||VZf f + A||Zf |p< K[\\VZPr + \\ZP-^l + /J || VZ^>-i(r) f dr] 
+a4VZPf+l\\ZPf/2. 

Using 11^^ Ilf < K Jg ||Zf (T)llfdr and Gronwall’s lemma, there exists a con- 
stant iV* > 0 such that 

llZfllf < K, f \\ZP-4l{T)dr < Vp > 1. 

do P- 

Similarly, we have a upper bound for HU/Hi. Hence, {VP} is a Cauchy 
sequence in g and so there exists a unique u G /i such that VP ^ u and 
vr — >■ Ut, as p — >■ oo. Lemma 2.1 is completed by letting p — >■ oo in (2.2). [] 
Let condition (Pi) denote this: ||d) — <I)ij|| -|- /i||<I) — d);i||i < iGd*+^||<I)||fc+i 
and condition (P 2 ): |d> — d>/j|_i < iG/i*+^||<I)||fe+i. Let g = u — u and G = 
fo ffu(su + (1 — s)u)ds. (2.1) is represented as 

{a{u)Wgt + b{u)Vg + f* c(u(T))Vg(T)dT, Vv) 

+A(gt, v)— < Gg, v >= 0 Vv G p. 



Lemma 2.2 Under condition (Pi), there exists a K = K{u) such that: (1) 

= EII^II<^(lklllA+l + ll‘I>IU+l)^^•(2) ^f 
1=0 

|^GP'=+1(U),J = 0,1,2, x:i|^||<iV(|H|2.fe+i + ||<I>|U+i) /i^■(3) zf 

j=o 

^ G P'=+i(f^), J = 0, . . . ,3, E ll^ll < iV(||v||3.fe+i + mk+i)h4 (4) tf 

1=0 

^ G P'=+1(U), J = 0,. . .,4, E ll^ll < iV(||v||4.fc+i + \mk+i)h\ where 

1=0 

\\m\ls = E (ll^llf + fo f-WWldr), r > 0. 

j=o 
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Proof: Let Ru{t) : [0,T] p, he the Ritz projection of u. From (2.3) and 

the inequalities in §1, we see 

a,||V? 7 (p + A||? 7 t|p< {a{u)Vrit + b{u)Vr] + J* c{u{T))Vri{T)dT,V{ut - Rut)) 
+A(? 7 t, ut - Rut) - {G-q, ut - Rut) 

-(6('u)V? 7 + /g c(u(r))V?7(T)dT, V?7()+ < Gq,qt > 

< tl|Vr,*|P + |||r7dP + e|k||? 

+K[\\q\\l + Jo ||V? 7 (r)|pfir+ ||m( - RutWf]. 

Noticing that q{t) = 77 ( 0 )+ qt{T)dr and 77 ( 0 ) = and using Gronwall’s 

lemma, we get the first conclusion of Lemma 2.2. 

Differentiate in time (2.3) and set A = at{t, u{t)) + b{u), B = bt{t, u{t)) + 
c{t,t,u{t)), G = Ct{t,T,u{T)), D = gu{t,u), and E = J^gy,{t,su+ (1 - 
s)u)ds + Ut fg guu(i, sm + (1 — s)u)ds. Then we derive at 

(a(u)V77tt + ^V77t + BVq + f* GVq(T)dT, Vv) 

+X{qtt,v)- < Dqt + Eq,v >=0 Vw € 77 . 

Similarly, 

a.wyvttr + Mlrntf < t ||V77„f + IWqur + 4vu\\l 

+K[\\qt\\l + II 77 II? + /o l|V77(r)f dr + ||u„ - RuuWl]. 

Hence the second conclusion holds by using the first one. [] Differentiating 
in t (2.4) similarly, we can obtain the third conclusion. The forth conclusion 
can be gained by a further differentiation sequence. [] 

Lemma 2.3 Under conditions (Pi) and (P 2 ), there exists a K = K{u) such 

that (1) J = 0, 1, E III? II < if(||77||i.fc+i + ||<I>|U+i) h<^+^; 

7=0 

(2) 7/ e J = 0 , 1 , 2 , E Il|?|| < K{\\uhy +1 + ||4>IU+i)/7'=+S- 

7=0 

(3) z/|^GP'=+ 1(H), E |||3||<iL(|Hl3,fc+i + ||<i>|U+i)/7'=+S7 = 0,...,3; 

7=0 

(4) P'=+i(H), E < if(||u||4.fc+i + \mk+i)h'^^\ J = 0, . . . , 4. 

7=0 

Proof: Define a G in the way 

(a(u)Va + b{u)Vq + f* c(u(T))Vq(T)dT, Vv) 

+A(a,v) — {Gq,v} = 0 Vv G H^(fl). 



( 2 . 5 ) 
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By subtracting (2.5) from (2.3), we see that Ut is the standard Ritz projection 
of Ut — a onto fx. Hence, by the elliptic regularity of (2.5) and Lemma 2.2, 
we see that 

\M < ll«ll + KhihtWi + llalli) < ||a|| + Kh>^+\\\uh,,+i + ||d>|U+i). (2.6) 

It remains to estimate ||a||. To do this, let /? G be defined by 

(a(w)V /3 + 5(u)Vt7 + f* c(u(T))Vy(r)dT, Vv) 

+A(j3,v)- <G jj,v >= (a,v) Vv € 

Setting V = /3 in (2.5) and v = a in (2.7) and denoting z = a — j3, we find 
that 

l|af = - (?7,V- [6(u)Vz]) +/g{< > 

-(7?(r), V • [c{u{t))V z{t)])}dT- <Gr],z> 

< K{\tj\‘^_, + ||?7||2 + /p [|77(r)|2 1 + ||?7(r)||2]dT} + e(|z|| + \\z\\l + |z||), 

2 2 2 2 

while subtracting (2.7) from (2.5) leads to ||z ||2 < -ff||a;||, so we obtain 

||a|| < KM0)\_. + h(0)|| +£[h,(r)|_. + ht{r)\\]dr}. (2.8) 

Now, we turn to estimate |? 7 t|_i. For this purpose, we define V G in 

such a way 

(a(M)V-d + b{u)Vr] + f* c{u{T))Vr]{T)dT, Vv) 

+A(-d,v)— < Grj,v >=< S,v > Vv £ 

where S G |5| i = i , and < 6, ijt >= Let (p = V — a. It 

follows from (2.5) and (2.9) that 

(a{u)Vip,Vv) + \{ip,v) =< S,v >, Vv £ (2-10) 

Hence, \\pW 2 < K\rit\_i. Setting v = r]t in (2.10) and using integration by 
parts, we obtain 

= {a{u)V'qt + b{u)Vr] + J* c{u{T))Vrj{T)dT,V{(p - Rp)) 

+Hvt, - Rv)~ <Gr],(p- R(f> - <r), b{u) > +(77, V • [b{u)Vp]) 
~/o{< > ~{t]{t),V ■ [c(u(r))V(p(t)])}cir+ <Gr],(p> 

< + Ml + fo MMWdr] + + h(0)f ] 

+K Sl[\Vt{T)?_i + WritMVdT + e\\ip\\l. 
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Applying Lemma 2.2 and Gronwall’s lemma to above inequality, we see that 

Iryd- J + ||<I>|U+i) +kJ^ h*(r)f dr. (2.11) 

Thus (2.8) yields a upper bound for ||q;||. Substituting this bound into (2.6), 
we see again from Gronwall’s lemma that 

\M<Kh'^+H\\u\\i,k+i + \mk+i). 



Thus the first conclusion of Lemma 2.3 is valid as a consequence of ||? 7 || < 

^[h(0)ll +/o \\Vt{T)\\dT]. 

To demonstrate the second conclusion, we introduce the auxiliary equa- 
tion 

(o(m) Vcr -h AVijt + BVt] + f* CV?i(T)dT, Vv) 

+A(a,v)— < Drjt + Ef],v >= 0 Vv € H^(fl). 

Subtracting (2.12) from (2.4) implies that Utt is the Ritz projection of Utt — cr 
in /i. Hence, by the elliptic regularity of (2.12) and Lemma 2.2, we have 

huW < M+Kh{\\pu\\i + \\ah) < M+Kh'^+^iWuh .fc+i + ll'i’llfc-i-i)- (2-13) 



To estimate ||ct||, we define ( € such that 

(a(w)VC -f AVrjt + BVr] + f* CVp(T)dr, Vv) ^ 

-|-A(C, v)— < Drjt + Er], v >= (ct, v) Vv G 

Setting V = ^ in (2.12) and v = a in (2.14), denoting y = a — and using 
integration by parts, we derive at 

\wr<K{\vtt, + \yt, + Hr + \\vr 

+ /o[WT)P_i+l|r7(T)f]dr} + e||y||i. 

Note that subtracting (2.14) from (2.12) results in ||y ||2 < A"||cr||, so from 
(2.15), (2.11), and the first conclusion, we can show that 



Ikll < A:(||u||i,fc+i -h ||4 >IU-hi). 

Now, by (2.13), we obtain the second conclusion of Lemma 2.3. Differentia- 
ting in time (2.4) for one and two times, we can derive the third and forth 
conclusions. [] 

It is easy to show that 




no 



Cui 



Lemma 2.4 Under conditions (-Pi), (P 2 ), (md k > d/2, there exists a K = 



K(u) such that: 


(1) (1) 


■-f dH 
V dp 


i G ij'=+l(n), j 


= 0,1, then |||^||l« 




>) + 


ll^lUo 


0 ( 1,00 


) < 


K; (2) 


■f d^u 

V dP 


■ G j = 


= 0,1,2, then IlglUo 




b + 


II^IU= 


0 ( 1,00 


)< 


K; (3) 


■f d^'u 

y dp 


G P'=+l(n), j = 


0, ...,3, t/ien II I^IIlc 


"(Lc 


») + 


II^IU= 


0 ( 1,00 


)< 


K; (4) 


■f 9^11 

y dp 


GH>^+Hn),j = 


0,...,4, </ien ||f^||i,c 




.) + 


II^IU= 


0 ( 1,00 


) < 


K. 













Remark. If we set c = 0 in (2.1), then the projection becomes a Sobolev 
projection. Since formulation (2.1) also includes the Volterra term, we call 
the new projection Sobolev- Volterra (SV) projection. We see, on one hand, 
from the above lemmas, that the SV projection has the same priori appro- 
ximation properties as the Ritz, Ritz- Volterra, and Sobolev projections. On 
the other hand, it reflects the global property of the combination of antihunt 
term, diffusion term, and memory term, so it may be very convenient to the 
FE numerical analysis for the pseudo-hyperbolic integral-differential equation 
and pseudo-parabolic integral-differential equation. In the next section, we 
apply the SV projection to simplify the FE analysis for problem (1.1). 



3 The FE Analysis 

Define the continuous-time finite element scheme: finding U{t) G /i, such that 

{q{U)Utt, v) + {a{U)VUt + h{U)VU + f* c(U(r))VU(T)dT, Vv) 

= (/(C^), ^)+ < > yv G fi,tG J, (3.1) 

U = ^h, Ut = ^h, t = 0, 

where is an approximation of 'I' in /i, c{U{t)) = c{t,T,x,U{x,T)), and 
q{U) = q{x,t,U). 

Theorem 3.1 Under conditions (Pi), (P 2 ), k > d/2, and ^ G 
j = 0, 1, 2, for scheme (3.1), then 

\\Ut — Ut\\L°°(L^) + \\U — ^111100(112) 

+h[\\Ut — Ui||l2(hi) + \\U — u||iioo(iii)] = 0(h''+^). 




Finite Element Analysis for Pseudo Hyperbolic Equations 



111 



Proof: Let ^ = U — u. Then U — u = ^ — rj. Manipulation of (1.1), (3.1), 

and (2.1) leads to the error equation 



{q{U)^tt,v) + {a{U)V^t + b{U)V^ 

+ /o c(C^(T))V5(T)dr,Vw) = ([g(w) - q{U)]utt 
+q{U)r]tt + [/([/) - f(u)] - Xr]t,v) 

+([a(u) - a{U)]Vut + [6(m) - b{U)]Vu + Jq[c{u{t)) 
—c{U{T))]X/u{T)dT, Vw)+ < g(U) — g{u),v > Vv G p, t G J, 
^(0) =0,6(0) = tii(O). 



Let u = 6 and estimate (3.2) term by term; we see that 

+ (a* - e)||V6f < K[\\gr + + httV 

+ /o MrWdr] + K[\m + ||6IP + Jo UirWidr] + e|16||?. 

Integrating this inequality for t from 0 to s, 0 < s < T, and using Gronwall’s 
lemma, we obtain Theorem 3.1. [] 

Divide [0,T] into M equal intervals, and let At = T/M, ti = tAt, t;_|_i = 
(1 + i)At, = (j){ti), dtcj)'- = - <!>''), dt4>^ = 

du4>'' = = i(^*+i+(^')^ and0^ = 

We have ^ 

4'{T)^{T)dT = At(/)(t,+ l )6'’^5 + £,((/), '(/')• 

A discrete-time finite element scheme can be defined: Find [/"+^ G p such 
that 



{q^(U)dttU", v) + (a”(t/)Vat[7" -h 6”([/)VC/” + At J2 c„/(C/)V[/'+5 , Vv) 
= if'iU),v) + {g^{U), v) Vu G /r, n = 1, 2, . . . , 

(3.3) 

where c„i{U) = c{tn,ti+r^x,U^^^) and q^{U) = q{x,tn,U”‘). Let = 
jjn _ Then [/” — m” = — 77"; we have such an approximation result. 



Theorem 3.2 Under conditions {Pi), {P 2 ), k > d/2, ^ G iL^+^(n), j = 
0, . . . , 4, and 

ll'^7?°lli + ll?°lli = 0(/r'=+^ + (At)2), (3.4) 

for scheme (3.3), then, 

\\dtt{U — + \\dt{U — u)\\l<=o(l^) + \\U — 

+h[\\dt{U - u)IUoo(^i) + \\U - ulUoo(ffi)] = 0(/i'=+i + (At)2). 
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Proof: We see from ( 1 . 1 ), ( 3 . 3 ), and ( 2 . 1 ) that ^ satisfies that, for n > 1 

and Vw € /r. 



x: Lf = + (a’^(i/)VdtC,Vv) 

= \[q^(u) - g^(U)]utt + g”(t/)[« - dttu^) + 5„77”] 

+ [ r ( u )- r ([/)]- Ayr , y ) 

+([a"(u) - a"(C/)]Vu” + a"(U)(Vu^ - VdtU^),Vv) 
-(&"([/) VC” - [&”(u) - 6 ”(C/)]Vm”, Vu) 

n— 1 n— 1 

-(At X) c„i(C/)VC'+5 + At X] [Cni(M) - C„i([/)]V{t '+2 

z=o /=0 

n—1 5 

- Z ei ( c , Vu ) it = t „, Vv )+ < g "( U ) - g "( u),v >= Z Ji ?, 

Z=0 2=1 



( 3 . 5 ) 



where Cni(u) 
that 



and 



c(tn,ti_f_i,x,u(x,ti_f_i)). Setting v = in ( 3 . 5 ), we see 



N-l 

2At Z - K\\VdteV 

n—1 

- KAt ' za + h -^ idtem ^ dter . 

n—O 



( 3 . 6 ) 



N-l 5 

2At ZER?< K[\\e-Ti + lie'll? + llrf*e°ll? 

n—1 i—2 

+ hlli~(L^) + htlli.(L.) + (At )1 + 



N-2 , N-l 

+KAt z [iM*e”ii? + (i + /i-^ii'ite”ii)iivdtC”ip]+ E iirii?, 

n—1 n—1 



( 3 . 7 ) 



N-l 



where and ^ ^) = ^) — 

n—1 

N-2 

{ 4 , tp 4 + E ( e *" ~ f /'") have been used to accomplish the steps of 

n—1 

( 3 . 7 ). It is easy to estimate other terms. Applying these relations to ( 3 . 5 ) 

n— 1 

and using ||C”||f < if||C°||? + E IMtC'll? and 

1=0 

N-l N-l 

+KAt Z ll ^^ te"|p + eAt E ll ^ tte "| p ) we obtain , for N > 3, 

n—0 n—1 



N-l 

At E \\dtter + \\dte-Ti<N\\dte\\i+\\e\\i 

n—1 

+(At )4 + + htiii.(i.) + h«iii.(i.)] 

N-2 N-2 , 

+K2At z iidtrii? + ^3At E (i + /i-5|MtC”ii)iivdtrf- 

n=0 n=0 



( 3 . 8 ) 
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For N = 2, we obtain (3.8) by estimating i?"(t = 2, ... ,5) directly. Under 
condition (3.4), the first term on the right side of (3.8) is less than 
+ (At)‘^]. We obtain Theorem 3.2 using an induction hypothesis procedure, 
together with Lemmas 2.2 and 2.3. [] 

Letting v = duU"' in (3.3), we have the stability result. 

Theorem 3.3 Under the conditions of Theorem 3.2 for scheme (3.3), N > 
2, then 

Af-l 

At E \\duU^'^+\\dtU^-Ti + \\U^\\l 

n—1 

N-1 

< K[\\dtUT, + \\U% + At E II/”(C^)IP (3.9) 

n—1 

+At e' + \9^-\U)\hian) + \AU)\htan)]- 

n—1 

Another time-discrete finite element scheme is defined: Find 
G /r such that 

n—1 

(g"([/)atVF",w) -h (a"(t/)VVF -h &"(C/)VU” -h At E Cni{U)VU ^+2 ,Vv) 

= (/" (C/), u)+ < g^{U),v > Vu G n = 1, 2, . . . 
d^[/" = lF”+3, n = 0,1,2,.... 

(3.10) 

Denote w = Ut, w = Ut, p = w — w, and 0" = IF" — w'^. Then we have 

Theorem 3.4 Under conditions (Pi), (P 2 ), k > d/2, ^ G iL^+^(U), j = 
0, . . . , 4, and 



l|0°lli + ll^ii + IIC°lli = 0{h>^+^ + (At)2), (3.11) 

for scheme (3.10), then 

\\dt{W - w)||i2(ii2) -I- ||1F - -u;||icx,(i2) -I- \\U - u||ioo(i2) 

+h[\\W- + \\U - ulUoo(^i)] = 0{h>^+^ + (At)2). 

Proof: By (1.1), (3.10), and (2.1), we have the error equation 

{q^{U)dte^,v) + (a”([/)V0”, Vu) = ([(?”(u) - g”(U)]wt 
+(7”(C/)[« - 3iU>”) + dtp^] + [r(u) - /”(C/)] - Xp/,v) 

-n 5 

-f([a"(u) - a”(C/)]Vw" -f a”(C/)(VF;” - Vih ), Vw) -f E R?, 



(3.12) 
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where i?7(* = 3,4,5) follows the form in (3.5). Setting v = dtO'^ = — 

6'^~h)/At in (3.12) and estimating the terms in a similar way as that for 
Theorem 3.2, we obtain 



At\\d,0^^ + ||02||2 < KlpYi + At||0i||? + ||^°||f + + (Af)4], 

a/e 119*0-11^ + 110^11? + 110^-^11? < KiPYi + mi + IICl? 

n=l 



+(At)4 + ||ry||i^(^.) 

AT-l , 

+AT3At E [1 + = 

n=2 



N-1 



+ lklli.(L.) + llp*lli.(L.)] + ^2At E wni 

n—1 

(||r-^|| + ||r-i||)](||0"||f + ||r-i||?), iv>3. 



With an analogous inductive hypothesis reasoning procedure to Theorem 3.2, 
Theorem 3.4 can be proved. [] 



Theorem 3.5 Under the eonditions of Theorem 3.4 for scheme (3.10), N > 
2, then 

N-l 

At E WdtWY + WW^WI + \\w^-Ti + WU^WI + \\u^~Ti 

n—1 

< K[\\WY + + \\UY + At e' wnuw 

n—1 

N-l 

n—1 

From all above results, we see that the FE schemes proposed in this paper 
are all uniquely resolvable and have optimal and convergent properties 
to the original problem. 

Remark 1. In the theorems in §3, we assumed k > d/2, which is needed 
in Lemma 2.4. However, in the case of semilinear equations, e.g., a = a{x, t), 
b = b{x,t), and c = c{t,T,x), since Lemma 2.4 is no longer necessary in the 
numerical analysis, this restriction on k can be removed. 

Remark 2. In almost all of the lemmas and theorems in this paper, we 
used conditions (Pi) and (P 2 ). Practically, these assumptions are not difficult 
to satisfy, for example, provided that 4) G iJ^+^(H) and is given by 

(V(4> - 4>/i), V?;) + (4> - <^>h, v) = 0, Vu G fi. 

It is trivial that (Pi) and (P 2 ) are available Pj. 

Remark 3. The idea in this paper can be used to a more generalized 
case, e.g., when the V • {aVut + bVu + fg cVu(T)dT) term in equation (1.1) 
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takes a more generic form + bij-§^ + Jo where 

i,i=l 

0-ij = aij{x,t,u), bij = bij{x,t,u), Cij = Cij{t,T,x,u{x,T)), and A = (atj) is a 
dx d symmetric and positive-defined matrix. We can present similar schemes 
and obtain similar conclusions as we have done here. 

Acknowledgments. The author thanks Professor Yirang Yuan for his hel- 
pful suggestions, and Professors Longjun Shen and Xijun Yu for their warmly 
encouragement . 
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A CFL-Free Explicit Scheme with 
Compression for Linear Hyperbolic Equations 

Ronald A. DeVore Hong Wang Jiang-Guo Liu 

Hong Xu 



Abstract 

We develop an unconditionally stable, explicit numerical scheme 
for linear hyperbolic equations, which arises as an advection-reaction 
equation in porous medium flows. The derived scheme generates accu- 
rate numerical solutions even if large time steps are used, and conserves 
mass. Furthermore, this scheme has the capability of performing ad- 
aptive compression while maintaining the accuracy of the compressed 
solution and mass conservation. Numerical results show the strong 
potential of the method. 



KEYWORDS: characteristic methods, hyperbolic equations, multiresolution 
analysis, wavelet decomposition 

1 Introduction 

Hyperbolic partial differential equations describe the displacement of oil by 
injected fluid in petroleum recovery, the subsurface contaminant transport 
and remediation, aerodynamics, and many other applications. Because of 
the moving steep fronts present in their solutions, the numerical treatment 
of these equations often presents severe difficulties. In late 1920’s, Courant, 
Friedrichs, and Lewy proved the famous result that there are no explicit, un- 
conditionally stable, consistent finite difference schemes for hyperbolic partial 
differential equations 0. Consequently, most numerical methods developed 
for hyperbolic equations are explicit but subject to the well-known Courant- 
Friedrichs-Lewy (CFL) condition. Very small time steps often have to be 
used in numerical simulations to meet the stability requirement of the nu- 
merical methods. Implicit methods could be unconditionally stable, but an 
algebraic system must be solved at each time step in order to obtain nume- 
rical solutions. 

Z. Chen, R.E. Ewing, and Z.-C. Shi (Eds.): Proceedings 1999, LNP 552, pp. 116-^^^ 2000. 

(c) Springer-Verlag Berlin Heidelberg 2000 
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In this paper we develop an unconditionally stable, explicit numerical 
scheme for linear hyperbolic equations. The derived scheme generates ac- 
curate numerical solutions even if large time steps are used, and conserves 
mass. Furthermore, this scheme has the capability of carrying out adaptive 
compression while maintaining the accuracy of the numerical solution and 
mass conservation. Computational results are presented to show the strong 
potential of the method developed. 



2 A Reference Equation 



We consider the initial-value problem for the linear hyperbolic equation 



Ut + V ■ (vu) + Ku = f(x,t), (x,f) gIR'^x(0,T], 

m(x,0) = uq(x), X G K'^, 



( 2 . 1 ) 



where v(x, t) := (Vi(x,t), V 2 (x, t), . . . , Vd{x, t)) is a fluid velocity fleld, K{x, t) 
is a first-order reaction coefflcient, /(x, t) is a given source function, u(x, t) is 
the unknown function, and uq(x) is a given initial condition, x := (a;i, . . . , Xd)-, 
■= V := (gf^, . . . , gf^). Wc assume that mo(x) and f{x,t) have 

compact support, so the exact solution u(x, t) also has compact support for 
any finite time t > 0. 

We define a uniform partition on the temporal interval [0, T] by := nAt 
with At = T/N. If we choose the test functions w{x,t) to be of compact 
support in space, to vanish outside the interval (t„_i,t„], and to be discon- 
tinuous in time at time tn-i, the weak formulation for Eq. (2.1) is 



u(x,tn)w(x,t„)dx - 



/IR'^ 



/iR'^ 



u{x,tn-l)w{x,t+_^)d 



t„_i ./]R‘ 
-I- 



u(wt + V • Vw — Kw){x, t)dxdt 

rt. r (2-2) 



X 



/(x, t)w{x, t)dxdt, 



Itn-i -'IR'^ 



where w{x, tij^_i) '■= limj_j^j+ w{x, t) takes into account the fact that w{x, t) 
is discontinuous in time at time t„_i. 

Motivated by the ELL AM framework of Celia et aZ P, the test functions 
w in Eq. (2.2) are chosen from the solution space of the adjoint equation 
of Eq. (2.1). Let y = r(0;x,t), with i G be the characteristic 

determined by 

= v(y,6»), with y|e=t = x. 



dv 

dO 



(2.3) 
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Then the adjoint equation of Eq. (2.1) is rewritten as 

w{r{e;5i,t),9)\g^i =w 

leading to the following expression for w 

w{r{e-, X, i), 0) = w{5i, T)e~ 






(2.4) 



(2.5) 



Therefore, once the test functions rc(x,t„) are specified at time t„, they are 
determined completely and in fact vary exponentially along the characteristic 
r(0;x,t„) for 0 G 

To avoid confusion in the derivation, we replace the dummy variables x 
and t in the second term on the right-hand side of Eq. (2.2) by y and 0. For 
any y G there exists an x G such that y = r{0; x, t„). We obtain 



/t„_i Jtr'^ 



' 11 “ Jt„ 



f{y,O)w{y,9)dyd0 



f{r{0; X, tn), 0)w{r{0; x, t„), 6») 



dr{0;x,tn) 



dx 



d0dx 



F{x,tn)w{x,t„) 



/]R“ 






( 2 . 6 ) 



dx + Ei{f,w) 



/]R“ 



A(x, tn)f{x, tn)w{x, tn)dx + Ei{f, w) . 



Here A{x,tn) = (1 — e~^^^’*’''>^*)/K{x,tn) if K{x,tn) yf 0, or At otherwise. 
Ei{f,w) is the local truncation error. 

Incorporating Eq. (2.6) into Eq. (2.2), we obtain a reference equation 



/lR“ 



u{x,tn)w{x,tn)dy. 



/ ■u(x,t„_i)w(x,t+_Jdx -I- / K{x,tn)Uw){yi,tn)dx + E(w), 
J]R“ J]R“ 

where E{w) = J*" ^ fj^d u[wt + v • Vw — Kw]{x, t)dxdt + Ei{f, w). 



(2.7) 



3 Multiresolution and Wavelet Decomposition 

To develop a numerical scheme based on the reference equation (2.7), we 
need to define the trial and test functions at time t„. To do so, we recall the 
notions of multiresolution analysis and wavelet decompositions. 
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In the standard Fourier analysis, L^-functions are represented by linear 
combinations of sines and cosines. In 1910, Haar studied the representation 
of L^-functions by step functions taking values ±1 |^. In the 1980’s, these 
ideas were explored and developed further into the theory of wavelets. The 
first wavelets were introduced in early 1980’s by Stromberg m and Morlet 
et aZ jH]. One of the best ways of constructing wavelets is multiresolution 
analysis. This approach began in image-processing pm] and was introdu- 
ced into mathematics by Mallat p. Multiresolution analysis was used by 
Daubechies |2| to construct compactly supported orthogonal wavelets with 
arbitrary smoothness, which include the Haar wavelets as the simplest case. 

Definition A sequence of closed subspaces (Z-the set of all in- 

tegers) of L^(IR) is a Multiresolution Analysis if 

(a) these spaces are nested: Vj C Vj+i for Vj € Z; 

(b) these spaces are dense in L^(IR): UjgzVj = L^(IR) and fljgzVj = 0; 

(c) Vo is invariant under integer shifts: /(•) S Vo => /(• — fc) G Vo, Vfc G Z; 

(d) Vj is obtained from Vo by dilation: /(•) G Vj /(2“b) G Vo, Vj G Z; 

(e) Vo is generated by a single (the so-called scaling) (p and its translates: 
{(po,k ■ h G Z} is an orthonormal basis in Vo, where 

(pj^kix) := - k), j, Zc G Z. (3.1) 

The conditions (d) and (e) in the definition implies that the family {4>j,k ■ 
fc G Z} forms an orthonormal basis for Vj. Let Vj : L^(IR) — > Vj be 
the orthogonal projection operator, then the conditions (a) and (b) in the 
definition concludes that for any / G L^(IR) 

.hm \\V,f-fh2^^^=0, (3.2) 

j/— >- + oo 

i.e., 

'Pjf = XI ^ 3 ,k)(l>j,k — > f, as j ^ -koo. (3.3) 

kC'Z, 

Let Wj-i be the orthogonal complement of Vj-i in Vj 

Vj = Vj-l © Wj-l = ... 

= Vj © Wj © Wj+I © ... © Wj-1, 



for j > J. 



(3.4) 
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Then, if / G L^(IR), the telescoping sum 

j-i 

V,f = Vjf + Y,{Vi+if-V,f) (3.5) 

i—J 

represents Vjf G Vj as an element of Vj©Wj©>Vj+i © . . .©Wj_i. By (3.2), 
Eq. (3.5) provides an approach of approximating arbitrary L^-functions by 
sequences of functions from Vj and Wi for J < j < +oo. More importantly, 
the spaces Wj can be generated by a single (the so-called wavelet) function 
In other words, ip and its integer translates V'o,fc> with being defined 

by 

i’j,k{x) := — k), j,k G Z, (3.6) 

constitute an orthonormal basis for Wq- For each fixed j, the V'i.fc {k G Z) 
form an orthogonal basis for Wj. 



4 A CFL-Free Explicit Scheme 
with Conservative Compression 



In this section we develop a CFL-free, explicit scheme for the initial-value 
problem (2.1). We define the finite-dimensional space 5j(IR‘^) to be the tensor 
product of the one-dimensional space Vj 

5i,(IR ) := (Vj(lR))‘^ = span ; (4-1) 

where 4>i.k(x) := ■ ■ ■ 4>3,kA^d) and 4>j,k{x) G V,(1R) is defi- 

ned by Eq. (3.1). 

With these preparations, we replace the exact solution u in Eq. (2.7) 
by the trial functions U(jx.,tn) G 5'j(lR'^) and drop the error term E(w) in 
Eq. (2.7), leading to the following numerical scheme: find U(x,tn) G 5(1R‘^) 
such that 



U{x,tn)w{x,tn)dy 



/IR" 



[ C/(x,t„_i)ii;(x,t+_Jiix+ [ A(x,t„)/(x,f„)ii;(x,t„)dx, 



Vw(x,t„) = ^>j,k(x) G Sj(JR ). 



Although the integrals in Eq. (4.2) are formally defined on the space IR'^, they 
are in fact supported locally since the test functions w(x, t„) = 4)j^k(x) have 
compact support. Moreover, because the solution C/(x,t„_i) and /(x,t„) 
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have compact support, Eq. (4.2) only needs a finite number of operations at 
each time level. Thirdly, because the scaling functions 4)j.k(x), k G form 
an orthonormal system for 5j(IR'^), the scheme (4.2) is explicit. It can be 
proven that the scheme (4.2) is unconditionally stable. 

It is well known that in practice the solutions of linear transport equations 
are fairly smooth outside some very small (dynamic) regions, where the solu- 
tions can develop steep fronts or even shock discontinuities. Therefore, some 
kind of local refinement or adaptive techniques should be used. Extensive 
research has been conducted in this regard in the finite difference, finite ele- 
ment, and finite volume methods. In the current context, this can be realized 
in terms of compression in a very natural way. For example, we can apply a 
thresholding to the scheme (4.2) to minimize the number of equations that 
need to be formulated and solved, and so significantly improve the computa- 
tional efficiency. However, a direct application to the scaling functions 4>j.k 
will introduce mass balance error. In contrast, because of their vanishing 
moments, applying such a technique to the wavelets does not affect mass 
balance. Hence, we use the wavelet decomposition (3.4) to choose another 
basis for the space Vj(IR) 



Vj-(]R) = span 



a— 0,1 



(4.3) 



where := 4>i^k{x) and (j)"yl{x) := This leads to another ex- 

pression for the space 5'j(IR‘^) 



= span|4>["^(x)| 






(4.4) 



In this case, the numerical scheme is still defined by Eq. (4.2), but using 
a different basis for 5i,(IR‘^). We begin by projecting the initial condition 
Mo(x) onto the space 5j(]R'^) to obtain an approximant Uo(x,to)- For n = 
1,2, . . . , N, we apply a thresholding technique to compress Uo(x,tn-i) to 
obtain [/(x,t„_i). Then, using its wavelet expansion as an error indicator, 
we solve the reduced scheme (4.2) at time step with the minimal number 
of basis functions used to obtain f/(x,t„). Next, we compress (/(x,t„) to 
obtain C/(x,t„). We continue this process until we reach the final time step 
tN = T. 




122 



DeVore, Wang, Liu, and Xu 



5 Numerical Experiments 

In this section we present numerical results to show the potential of the 
numerical scheme developed in this paper. The spatial domain is = (0, 5), 
the time interval is [0,T] = [0,2], a velocity field V = 1 + 0. la; is chosen, 
K = f = 0. The initial condition uo(x) is 




Figure l:The initial condition uo{x). 



Uo{x) := Imp(a;; 1, 2, 0.05) - 0.2 Tri(a;; 1.5, 0.1), (5.1) 

with the hat function Tri(a;; a, b) and the impulse function Imp(x; a, b, c) being 
defined by 



Tri(a;; a, b) 



1 + 


X — a 


if 


a — b < 


X < a, 


h 

X — a 


1 - 


if 


a < 


X < a + b, 


b 



0 otherwise. 



(5.2) 



and 



Imp(a;; a, b, c) 




(5.3) 



where the error function erf(a;) := {2/^/n) e~^ dz. The initial condition 

uo(x) is plotted in Figure 1. 

In the numerical experiments, we use a time step of At = 0.4. The coarse 
and fine spatial levels are J = 3 and j = 7, respectively. This leads to a 
maximal Courant number of 192. At the final time T = 2, the error of the 
uncompressed solution is 0.00477 while the error of the compressed solu- 
tion is 0.00515. The compression ratio is 30.19. We plot the uncompressed 
and compressed solutions against the analytical solution in Figure 2. 
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Figure 2:The analytical, uncompressed, and compressed numerical solutions. 
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Abstract 

Computers today rely heavily on good utilization of their cache me- 
mory subsystems. Compilers are optimized for business applications, 
not scientihc computing ones, however. Automatic tiling of complex 
numerical algorithms for solving partial differential equations is simply 
not provided by compilers. Thus, absolutely terrible cache performance 
is a common result. 

Multigrid algorithms combine several numerical algorithms into a 
more complicated algorithm. In this paper, an algorithm is derived 
that allows for data to pass through cache exactly once per multigrid 
level during a V cycle before the level changes. This is optimal cache 
usage for large problems that do not fit entirely in cache. The numerical 
techniques and algorithms discussed in this paper can be easily applied 
to numerical simulation of fluid flows in porous media. 



KEYWORDS: multigrid, cache, threads, sparse matrix, iterative methods, 
domain decomposition, compiler optimization 

1 Introduction 

Multigrid methods are widely known as the fastest methods for solving elliptic 
partial differential equations. This belief was derived when computers were 
designed very differently than today. Accessing one word of data took a set 
amount of time due to computers having one level of memory. 

Since the early 1980 ’s, processors have sped up 5 times faster per year 
than memory. Multilevel memories, using memory caches were developed 
to compensate for the uneven speed ups in the hardware. Essentially all 
computers today, from laptops to distributed memory supercomputers use 
cache memories to keep the processors busy. 

Z. Chen, R.E. Ewing, and Z.-C. Shi (Eds.): Proceedings 1999, LNP 552, pp. 124-^^^ 2000. 

(c) Springer-Verlag Berlin Heidelberg 2000 
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(xo,yo) X 



{xn+i,Un+i) 



Figure 1: Simple grid with red points marked 



By the term cache, we mean a fast memory unit closely coupled to the 
processor. In the interesting cases, the cache is further divided into a great 
many cache lines, which hold copies of contiguous locations of main memory. 
The cache lines may hold data from quite separate parts of main memory. 

Tiling is the process of decomposing a computation into smaller blocks 
and doing all of the computing in each block one at a time. Tiling is an 
attractive method for improving data locality. In some cases, compilers can 
do this automatically. However, this is rarely the case for realistic scientific 
codes. In fact, even for simple examples, manual help from the programmers 
is, unfortunately, necessary. 

Language standards interfere with compiler optimizations. Due to the 
requirements about loop variable values at any given moment in the compu- 
tation, compilers are not allowed to fuse nested loops into a single loop. In 
part, it is due to coding styles that make very high level code optimization 
(nearly) impossible m- 

Before transforming the standard Gauss-Seidel algorithms into cache aware 
versions, let us define two operations for updating the approximate solution 
on either one or two rows of a grid: 

Update{ row, color [, direction] ) 



and 



UpdateRedBlack{ row, [, direction] ) 

We implicitly assume that both Update and U pdateRedBlack only compute 
on rows that actually exist. 

The operation Update does a Gauss-Seidel update in each of the columns 
of a single row in the grid. The color is one of red, black, or all. The direction 
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Standard 




Cache aware 


1 . Do j — 1,N 


1. 


Update{l, red). 


la. Update[j, red). 


2. 


Do j — 2, N 


2. Do j = 1,N 




2a. UpdateRedBlack{j). 


2a. Update{j, black). 


3. 


Update{N, black). 



Figure 2: Standard and cache aware Gauss-Seidel 



is optional and can be natural or reverse. The natural order is assumed unless 
a direction is given. Hence, symmetric Gauss-Seidel is easily implemented. 

The operation UpdateRedBlack operates on all of the red points {xi,yj) 
in row j of the grid and on all of the black points {xi,yj±i) in the preceding 
row j±l (the updates are paired). The ± depends on the choice of direction. 
This fuses the red-black calculation so that we do a red-black ordered Gauss- 
Seidel with only one sweep across the grid instead of the standard two passes. 

The update operations can be modified for SOR, SSOR, or ADI relaxation 
methods. Within the update operations we can further optimize the process 
by including Linpack style optimizations like loop unrolling to get 2 — 7 
updates per iteration through the loop. 

Some of the motivation behind this paper can be summarized in a simple 
example. Gonsider the grid in Figure 0 where the boundary points are 
included in the grid. Both standard and cache aware algorithms for the 
red-black ordered Gauss-Seidel iteration are given in Figure O 

Each iteration of the standard algorithm, all of the data passes through 
the cache twice. Hence, with a small change in the algorithm’s implementa- 
tion, the data only passes through cache once. Unfortunately, no compiler 
for commonly used languages (e.g., Fortran, G, or G-l— 1-) seems to exist that 
can optimize the first form of the red-black ordered Gauss-Seidel algorithm 
automatically into the second form. 

Multigrid algorithms combine a number of operations in order to work. 
These include iterative methods (typically relaxation methods), residual com- 
putation, projection of residuals onto a coarser grid, and interpolation of 
corrections onto a finer grid. These are typically programmed as separate 
routines, which makes the components easy to replace and modify. 

However, a number of components re-use data in a manner that is suitable 
for algorithms that are cache aware. Algorithms will be developed such that 
data passes through the memory cache once while computing on a given level 
before a level change. 



Maximizing Cache Memory Usage for Multigrid Algorithms 



127 



This paper is concerned with algorithmic changes that are highly por- 
table. Such techniques as loop unrolling, though mentioned, are not really 
stressed. The intention is that codes written using the algorithms in this 
paper will work well on anything from a PC to a high end RISC processor 
based supercomputer with only trivial tuning (one parameter). This means 
that we are not trying to get every last floating and flxed point operation out 
of a computer, just an integer factor speed up for a modest amount of work. 

2 Relaxation Methodology and Notation 

Consider solving the following set of problems: 

AiUi — y), 1 ^ ^ ^ 

where ui G IR"' and > n^+i. Level 1 is the real problem that the solution is 
wanted on. All other levels are smaller, or coarser, approximations to level 1. 
The linear systems result from discretizing a partial differential equation 
over a given grid The discretization can be any standard finite element, 
difference, volume, or wavelet approach (see Q and |2 for examples of this 
approach). Further, the discrete grids fli will be assumed to be structured 
and regular. 

2.1 Once through Cache Naturally Ordered Gauss-Seidel 

Consider the naturally ordered Gauss-Seidel first. Let us restrict our atten- 
tion to matrices Aj which are based on discretization methods which are local 
to only 3 neighboring rows of the grid (e.g., a 5 or 9 point discretization). 
We have to assume that f + m — 1 rows of a NxN grid G fit entirely into 
cache simultaneously and that m < £. 

Figure 0 contains the complete algorithm for passing data through cache 
only once for the naturally ordered Gauss-Seidel. There are two special cases 
to the cache aware algorithm: the first block of rows and the rest of the 
blocks. 

The first case is for the first £ rows of the grid. At the end of step 1 in 
Figure 0 the data associated with rows 1 to £ has been brought into cache 
only once, not m times. The data in rows 1 to £ — m -I- 1 has been updated m 
times. The data in rows j, £ — m + 2<j<£ has been updated £ — j + l times. 

Once the first block of grid rows is partially updated, we have a new block 
to update and must also finish updating the previous block of grid rows. This 
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Algorithm Cache-GSNat 



Do 


it = 0,m 


- 1 




la. 


Do i = 


- 1,£ — it 






lal. 


Update{i, all). 




Do 


j = £+l 


,N,L 




2a. 


Do it - 


= 0, m — 1 






2al. 


Set ji = min(j + £,N). 






2a2. 


If j + £ < N, then 








2a2a. j 2 = max(jii — it — 1, 


<3) 




2a3. 


Else 








2a3a. j 2 = N 






2a4. 


Do i = j,j2 








2a4a. Update{i, all). 






2a5. 


Do i = j — l,j + it — ra + 1, 


-1 






2a5a. Update(i, all). 





Figure 3: Once through cache naturally ordered Gauss-Seidel algorithm 

corresponds to step 2 in Figure0 Note that the second inner loop (step 2a3) 
runs in the opposite order as the first inner loop (step 2a2), which ensures 
that the updates are bitwise identical to the standard algorithm. In effect, we 
have applied a domain decomposition methodology to the standard iteration. 
Each iteration requires a new, slightly smaller domain. 

The cache aware algorithm can be generalized to grids which are too 
large to fit entire rows into cache. In this case, a multidimensional approach 
is necessary when dropping rows and columns from the subdomains. In each 
of the three pictures below, computation occurs in the smallest subdomain 
only (the outer parts represent the buffers that have been added) . The buffers 
grow by one line in each of x and y each iteration. 




While computing in neighboring subdomains, the rest of the updates are 
done carefully in order to maintain bitwise identical updates. In the current 
subdomain, the calculation is done using the natural ordering. In the trailing 
subdomain, the calculation is done in the opposite order. In the three pictures 
below, the shrinking subdomain on the left is the trailing subdomain. 
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Algorithm Cache-GSRedBlack 
1. Do j = 1, N, 1. 

la. Set ji = min(ji + £ — 1,N). 

lb. Do it = 0,m — I 

Ibl. Do i = j,ji — 2* it 

l bla. Update{i, red). 

l blb. Update(i — 1, black). 

Ib2. Do i=j — l,j — m + 2*it, —2 

lb2a. If mod(ji,2)= 1 then 

lb2al. UpdateRedBlack{i) 

Ib2a2. UpdateRedBlack{i — 1) 

lb2b. If mod(j,2)= 0 then 

lb2bl. UpdateRedBlack{i — 1) 

Ib2b2. UpdateRedBlack{i) 



Figure 4: Once through cache red-black ordered Gauss-Seidel algorithm 



2.2 Once through Cache Red-Black Ordered Gauss-Seidel 

Red-black ordered Gauss-Seidel is significantly more complex to code in the 
once through cache style rather than in the normal style. Let us restrict our 
attention to matrices Aj which are based on discretization methods which 
are local to only 3 neighboring rows of the grid (e.g., a 5 or a 9 point discre- 
tization). For a discussion of a 9 point discretization on this topic, see 0 
and [E]. 

Figure ^ contains the complete algorithm for passing data through ca- 
che only once for the red-black ordered Gauss-Seidel. In EH the naturally 
ordered Gauss-Seidel was shown geometrically to be similar to a domain de- 
composition method where the size of the subdomains shrank each iteration 
(forming an increasingly larger buffer each iterations). For the red-black or- 
dered Gauss-Seidel, the buffer is saw tooth shaped on the top and grows each 
iteration by points from two grid rows instead of one. On the right side, the 
buffer grows each iteration by only one grid column (similar to the naturally 
ordered Gauss-Seidel case). 
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Figure 5: Simple triangular grid 



2.3 Once through Cache ADI 
and Line Relaxation Methods 

ADI and line relaxation algorithms are easily made cache aware using similar 
techniques to the ones given in St |‘/!.l 12.21 One difference is that entire lines of 
unknowns must fit in cache at once in order to guarantee bitwise compatibility 
with the standard algorithm implementations. 

2.4 Once through Cache Relaxation Methods 
on Triangular Grids 

Triangular grids are also easily made cache aware. The grids must be quasi- 
uniform and highly structured and the ordering of Ai must lead to blocks 
of diagonal submatrices. A simple example of a suitable grid is given in 
Figure El A combination of the methods given in S ll'z!. It/!. '/!l with the nonzero 
graph of the A,’s is used. 

Grids that are unstructured, including ones that have been chosen through 
an adaptive gridding procedure, require a different approach than is covered 
in this paper. m briefly discusses an unstructured grid approach. 

2.5 Unstructured and Quasi-Structured Grids 

Unstructured grids presents a challenge that is not addressed in this paper. 
Quasi-unstructured grids (see Figure 0) can frequently be accommodated 
using techniques similar to structured grids. In particular, the number of 
graph connections in the matrices are usually predictable, just like in the 
structured grid case. Both of these cases are considered in 0, 0, and C]. 
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Figure 6: Quasi-structured grid. 



Algorithm Vcycle( k, fi,ri}i=i ) 

1. Do i = 1, 2, • • • , fc — 1 

la. Approximately solve AiUi = fi. 

lb. Compute a residual rt fi — Aim. 

l c. Set fi+i <— RiVi and m+i 0. 

2. Do i = fc, fc — 1, • • • , 1 

2a. Approximately solve Aim = fi- 
2b. If i > 1, then set m-i <— m-i + PiUi- 



Figure 7: V cycle definition 
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3 Combining Multigrid Components 

A typical multigrid method is based on a V cycle multigrid method (see 
Figured)- Implementing a W or F Cycle (or any other correction cycle) is a 
trivial extension. 

All multigrid correction algorithms are a simple combination of two di- 
stinct parts: the pre- correction step and the post-correction step. These are 
referred to by McCormick m as slash cycles. While both steps may have an 
approximate solve step included, the change of level step at the end of each 
has quite different cache effects. 

There are 3 major operations in the V cycle: 

1. Approximate solves: steps la and 2a. This is typically a relaxation 
method for simple problems, but can be any iterative method. 

2. Restriction of residuals: steps lb and Ic. This is typically a weighted 
average of nearby residuals. It is used to compute a residual correction 
problem’s right hand side (represented by the operator Ri) on the next 
coarser grid. 

3. Prolongation of corrections: step 2b. This is typically a second or fourth 
order interpolation method (represented by the operator Pi). 

In a typical multigrid code, a V cycle is implemented very similarly to the 
description given here. By using a structured language methodology, different 
algorithms can be substituted for the default ones trivially. 

As was shown in 0, when a grid fli gets too large, a change in the al- 
gorithm is required which changes the global ordering. In essence, we use a 
domain decomposition approach to find disjoint two dimensional subdomains 
idij whose union is 11^. Further, the data associated with the Gauss-Seidel 
operation on each subdomain must fit entirely in cache. The size of the sub- 
domains depends heavily on the number of nonzeroes per row in the matrix 
Ai, the sparse matrix storage method, and what iteration of the Gauss-Seidel 
iteration we are on. Hence, we really have a set of disjoint subdomains 
where £ = 1, • • • , m. 

Data passes through cache once almost everywhere each time a level is 
reached with this transformation. Due to connections between subdomains, 
sometimes a very few points (rather than whole subdomains) have data pass 
through cache twice. However, we can do better, as will be described in this 
and the next section. 
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Algorithm Cache-Vcycle( k, {Ai,Ui, fi,Ci}i^i ) 

1. Do i = 1, 2, • ■ • , fc — 1 
la. Do £ = 1, 2, ■ • ■ , mi 

lal. Determine 
la2. For each 

la2a. If £ = 1, then m 0. 

la2b. Do 1 iteration of approximate solve of AiUi = /i. 

Ia2c. If £ = mi, then compute as much of Vi as is possi- 
ble. 

Ia3. If £ = mi, then complete the calculation of Vi 

and calculate fi+\ = RiVi. 

2. Do i = fc, fc — 1, • ■ • , 1 
2a. Do £ = 1, 2, ■ • ■ , mi 

2al. Determine 
2a2. For each 

2a2a. If £ = 1 and A: = 1, then Ui 0. 

2a2a. If £ = 1 and A: > 1, then 

Ui Ui -fPi+lUi+l I^P). 

ij ij ij 

2a2c. Do 1 iteration of approximate solve of AiUi = fi. 

Figure 8: Cache aware V cycle definition 

The computational subdomains must be further refined in order to 
use as large of subdomains as possible each iteration of the relaxation algo- 
rithm. The last iteration of the relaxation algorithm must be treated diffe- 
rently due to the projection or interpolation steps that must be done. As is 
noted in El , only 50-60% of the cache is actually available for use by a given 
program. This is a side effect of multitasking operating systems. 

Determining the sizes of the s per iteration £ can be done as a pre- 
processing step and is inexpensive. In order to efficiently do loop unrolling 
and/or tiling, we need a small amount of information about the computer 
that we are going to use. First, we need to know how big the usable part of 
the cache actually is. Knowing the size of usable cache and the number of 
points in a line we calculate 

• for either a precorrection or a postcorrection step. 

• for £ = 2, • • • , m* - 1. 

• for either a precorrection or a postcorrection step. 

The second item we need to know is how many points to unroll in the loops 
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Figure 9: Five point discretization, point relaxation 
• Update 

o Can compute residual 
@ Can compute projection 




Figure 10: Nine point discretization, point relaxation 

in the Update and UpdateRedBlack code. 

Three steps occur during the pre-correction step: smoothing, residual 
calculation, and projection. Two sets of computational subdomains fl U are 
necessary. One is for when just the relaxation method is used as a smoother 
and the other is when all three components are used at once. 

There is a scheduling issue when implementing cache aware multigrid 
algorithms. Figiires lui I ll gra.nhicallv show when the residual can be computed 
based on the last update in the relaxation method. Where a projection can 
be centered is also shown based on which residuals have been computed. 
Figure O shows when interpolation to the next finer level can be scheduled. 
What is expressed is simply a “compute as soon as you can” principal. 

For simplicity, point relaxation methods should probably be written assu- 
ming a nine point operator (see Figure CH). In order to write highly efficient 
code with no special cases, two extra “ghost” rows and columns are needed in 

• Update 

o Can compute residual 
(•) Can compute projection 




























Figure 11: Five or nine point discretization, line relaxation 
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• Update 

X Can compute interpolants on finer 
grid 



Figure 12: Five or nine point discretization, interpolation 



the computational grids. Values there are set to zero and no relaxation upda- 
tes occur (requiring a minor post processing). While for tiny grids this adds 
a significant amount of storage, we are only interested in problems that are 
much larger than would fit in the cache. Hence, the padding on the coarser 
grids only amounts to a trivial increase in the overall storage requirements 
(and is useful for parallel computing versions of the algorithms). 

Two steps occur in the post-correction step: smoothing and interpolation. 
In order for this half of a correction scheme to be optimally cache aware, the 
last step must use a somewhat smaller than the rest of the iterations. 

This is because the interpolant on level i -I- 1 is added to much larger vector, 
which is typically four times the size of the vector on level i. 



4 Numerical Results and Conclusions 

In 1^, and jZ] a collection of problems are solved on structured grids (2D 
and 3D) and unstructured grids (in 2D). Speedups range from 100% to 300% 
over using standard, well coded implementations. 

Reducing the number of times data passes through cache to the absolute 
minimum eliminates one of the major areas where multigrid does not take 
full advantage of the hardware that it is implemented on. The algorithms in 
this paper show how to reach this minimum. 

When using cache aware implementations, there are two issues that must 
be faced: is true portability wanted and how much performance is demanded? 
By true portability, all that must be determined for a given machine are two 
numbers: the number of elements in a cache line and the number of cache 
lines that can be guaranteed to be usable. 

If ultra high performance is demanded, true portability becomes much 
harder. Loop unrolling, relaxation updates along diagonals, specialty BLAS 
for short vectors (in machine language), and other machine specific opti- 
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mizations are necessary. One interesting aspect is that most RISC based 
processors are very similar. Hence, the non-machine language optimizations 
will work well on similar processors. 

Using cache aware multigrid algorithms requires a different programming 
style than is traditional (see 0 and [51 ) • In order to make the codes viable, a 
rigid programming style must be maintained for all components of the code. 
This includes a uniform subscripting method for variables, an isolation of 
the minimal number of lines of code that is necessary for a give discretized 
problem to do major components (e.g., updating a point by the relaxation 
method), and a decision on how portable the code will be. 

If only true portability is demanded, a general code can be constructed 
quite easily for 5-9 point operators Ai (see 0 for example codes). For the 
examples in only four quite small pieces of code have to be written as 
separate “include” files or macro definitions. Only the interpolation file will 
typically have more than one target point to modify. 

The ideas in this paper can be applied not just to natural and red-black 
ordered Gauss-Seidel, but to SOR variants, line relaxation methods, and ADI. 
Applying the ideas to typical parallel smoothers is also a straight forward 
extension. 
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A Locally Conservative Eulerian-Lagrangian 
Method for Flow in a Porous Medium 
of a Mixture of Two Components 
Having Different Densities 

Jim Douglas, Jr. Felipe Pereira Li-Ming Yeh 



Abstract 

The object of this paper is to develop an efficient, conservative, 
Eulerian-Lagrangian numerical method for the differential system de- 
scribing miscible displacement of one incompressible fluid by another 
of different density in a porous medium. The method will be a variant 
of the “Locally Conservative Eulerian-Lagrangian Method” that has 
been studied for immiscible displacement. 



KEYWORDS: LCELM, MMOC, MMOCAA 

1 Introduction 

There have been many discussions of numerical methods for the simulation 
of miscible displacement in porous media over the past two decades; see, e. 5 ., 
0iiniiTS]iTiiTiiniiiiE3i2HiEnii2iiniiniEzi However, almost all of 
this work has been related to a model for miscible flow that is consistent with 
the conservation of momentum only if the two components in a binary mix- 
ture have equal densities m- The object here is to treat the more realistic 
and complex system m that properly describes the flow of a binary mixture 
when the components are allowed to have significantly different densities; in 
particular, the numerical approximation of this system will be addressed by 
an efficient and locally conservative Eulerian-Lagrangian method {LCELM) 
that takes into account the dominance of transport over diffusion in the phy- 
sics. Our method is based on an LCELM treatment of immiscible displace- 
ment US; a somewhat similar locally conservative method was analyzed by 
Arbogast and Wheeler P| for linear problems; they and Chilikapati |2| had 
applied their method to the older miscible displacement model, though not 

Z. Chen, R.E. Ewing, and Z.-C. Shi (Eds.): Proceedings 1999, LNP 552, pp. 138-^^^ 2000. 

(c) Springer-Verlag Berlin Heidelberg 2000 
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without some difficulties. It is clear from the title of 0 that their derivation 
of their method was based on different concepts from those underlying our 
derivation. 

The authors consider the LCELM in m and in this paper to be a natural 
development in the family of modified method of characteristics (MMOC) 
procedures originally introduced by Douglas and Russell and applied 
first by Russell Ei E3 and then by many others to (equal density) mis- 
cible displacement and to immiscible displacement P3l im ESI 123 EHl Eni- 
The MMOC has several, by now well-known, advantages (including, in par- 
ticular, computational efficiency) and one fundamental flaw, which in the 
miscible displacement problem is the failure to preserve as an algebraic iden- 
tity the total masses of the two components, an important physical require- 
ment. Recently, for the immiscible displacement problem, Douglas, Furtado, 
and Pereira m formulated a variant of the MMOC called the modified 
method of charaeteristics with adjusted advection {MMOCAA) which does 
conserve the component masses globally and also preserves the conceptual 
and computational advantages of the MMOC] however, it does not neces- 
sarily conserve the component masses locally. There is a collection of finite 
element methods, called generically ELLAAI, that can address conservation 
problems locally, though not all implementations of these methods do exhi- 
bit local conservation; see EJ |H 0 0 1^ ■ We believe that ELL AM methods 
have not been applied to the system for miscible flow with differing densities. 
The cost of ELLAM procedures appears to be significantly greater than for 
the MMOCAA procedure or the LCELM technique to be described in this 
paper. 

As mentioned above, an LCELM procedure has been described in detail 
by the authors in m for approximating the solution of immiscible displa- 
cement in porous media; the experimental evidence reported in m clearly 
indicates the superiority of the LCELM over either the MMOC or the 
MMOCAA. A convergence proof for a simpler application of the LCELM 
concept has been obtained by Douglas and Huang m- There is a fundamen- 
tal distinction between the various MMOC procedures and LCELM tech- 
niques. All MMOC schemes relate to approximating transport-dominated 
parabolic problems beginning from nondivergence forms of the equations; 
LCELAI schemes are derived from the divergence forms of the equations. It 
is the use of the divergence form of a parabolic equation that allows relatively 
easy localization of desired conservation principles in a form amenable to the 
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application of finite element or finite difference methods. Both MMOC and 
LCELM procedures are properly viewed as operator-splitting techniques, 
with the transport being separated from the diffusion in the parabolic equa- 
tion. The two families of methods treat the diffusive part of a time step in 
like manner; the difference lies in the treatment of the transport. In MMOC 
procedures, certain characteristics are traced back from the advanced time 
level to the previous level; normally, these characteristics are associated with 
node points for finite differences or quadrature points for finite elements. 
The trace-backs arising in LCELM procedures lead to predecessor sets, not 
predecessor points. In the immiscible displacement problem, the integral 
curves that generate the predecessor sets are not the characteristics of the 
first-order transport operator coming from the nondivergence form of the sa- 
turation equation; for the miscible displacement problem the relevant integral 
curves do coincide with characteristics of the first-order transport operator, 
though they are usually associated with points that would not be employed 
in MMOC schemes. 



2 Unequal Density Miscible Displacement 

A model m for unequal-density miscible displacement can be described as 
follows. The model assumes that the components are incompressible and that 
there is no change in volume resulting from the mixing of the components; 
i.e., the volume of the mixture is the sum of the partial volumes. Let and 
Pfj denote the densities of the and pure components in the binary 
mixture and assume that > pp. Set 

O' = {pa- Pp)/ Pp- 

Let c = Ca = 1 — cp] then the density of the mixture is given by 

P = P(c) = pp{l + crc). 

Denote the pressure by p and the Darcy velocity by u. The pressure equation 
is given by the equations 

V • -I- ^--j- — D(u)Vc^ = q, X G fl, 

-^k{x) {Vp- p{c)gVd), 



u = — 



X gLI, 



( 2 . 1 ) 
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where D{u) is the dispersion tensor given by 

D{u) = (j){x){dmolI + \u\{dlongE{u) +d trans E^{u))), (2.2) 

4>ix) is the porosity of the medium, dmoi is the (small) molecular diffusion co- 
efficient, diong and dtrans are the longitudinal and transverse dispersion coeffi- 
cients, E{u) is the projection along the Darcy velocity, and E^(u) = I — E{u) 
is its orthogonal compliment. Also, q is the total external flow rate, with q > 0 
denoting injection and q < 0 production; k{x) is the permeability tensor; /r(c) 
is the viscosity, frequently quite strongly dependent on the concentration; g 
is the gravitational constant and, finally, d{x) is the depth function in the 
reservoir LI. For simplicity, a “no physical transport” boundary condition 

u - V = 0, X G dLl, (2.3) 



will be assumed; then, compatibility to incompressibility requires that 

/ q{x,t)dx = 0, Vt. 

Jn 

The concentration equation, in divergence form, is given by 

1 



dc 



D{u)L7c ] = cq, X G LI, 



1 + ac 

where c is the concentration in the fluid in the external source; 



c{x, t) = 



c{x,t), a specified function, if g > 0, 
c{x,t), the resident concentration, if g < 0. 



(2.4) 



(2.5) 



A “no diffusive transport” boundary condition will be assumed for the con- 
centration, and an initial concentration must be specified: 



(DVc) ■ V = 0, X G dLl, 



( 2 . 6 ) 



and 

c{x,0) = Cimt(x), xGLI. (2.7) 

The model above reduces to the older model if tr = 0; i.e., when the two 
components have equal densities. 

Recall m that the Darcy fiux u is a mass-averaged fiux, not the volu- 
metric flow rate. It is very convenient to introduce the volumetric flow rate; 
let 

a 

<JC 



w = u + 



D{u)Vc. 



( 2 . 8 ) 
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Note that it follows from the two boundary conditions (2.3) and (2.6) that 

w ■ = 0, X £ dfl. (2.9) 



A small amount of manipulation shows that the concentration equation can 
be rewritten in the form 



4>c 



cw 



- Vx ■ {DVxc) = cq = cq^ - cq . 



(2.10) 



It is also convenient to write (2.10) in mixed form by introducing a concen- 
tration flux; let 

z = -D{u)VxC. (2.11) 



Then, the concentration equation takes the mixed form 




-I- Vx ■ z = cq~^ — cq 



X £ fl, 
X £ fl. 



( 2 . 12 ) 



If the pressure equation is written in terms of w and p and the concentra- 
tion flux z is used to replace HVc in the equation, then the pressure equation 
can be seen to be given, also in mixed form, by 

fi(c)k(x)~^w + VxP = p(c)gVxd — ^ k(x)~^z, x £ id, 

1 + ac 

Vx - w = q, X £it, (2.13) 

u = w -\ z, x£fl. 

1 + ac 

The boundary conditions are expressed by the relations 

z ■ V = w ■ V = Q, X £ dil. (2-14) 



The introduction of the volumetric flux w will allow the system to be 
treated in a manner that closely resembles the way the older model has been 
approximated; in particular, we shall be able to use essentially the same 
operator splitting for the new system as for the old. If w had not been 
introduced, it is not clear that the pressure and concentration equations 
could be effectively separated as they are in the procedure described below. 
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3 The Basic Operator Splitting Algorithm 

Assume that we shall approximate both {w,p} and {z, c} in the lowest index 
Raviart-Thomas mixed finite element space TZT = V x W m over the same 
partition {flj}. (There is no logical constraint that the partitions for the 
pressure and concentration be the same; in fact, we shall introduce a local 
refinement in the pressure partition in order to obtain a better approximation 
of a point source or sink. There can be other advantages to the use of different 
partitions for the two equations; see HnUnilTHI.) Let 

Atp = iiAtc = iii2Atct, (3.1) 

where ii and are positive integers; in their numerical experiments the 
authors have frequently chosen ii to be one or two and ^2 to be about ten. 
Set 

t = mAtp^ tn = TiAtc^ tji f^ = tji T nAtct^ 

where normally 0 < k < Z 2 - The pressure equation will be approximated 
at time levels t"*; the concentration will be approximated through transport 
microsteps corresponding to times and diffusive steps corresponding to 
times tn- We shall indicate a function / evaluated at these times by /"* = 
/(f™), /„ = /(t„), and fn,K. = all functions will be assumed single- 

valued {i.e., if = tn, then = /„). 

Define an extrapolation operator as follows: 

f f, 

Ef{t)=l f-t— 1 t-t^ , 

I -j-m ^m—1 •' 1 ’ 

Then, the algorithm can be described as follows. 

1° Given = Cinit{x), find {W°, C/°}, where {W°,P^} £ TZT and 

If Cinit vanishes, so that the simulation begins when the injection of the 
solvent commences, then = 0. If Cmit is not identically zero, then 
the initial pressure problem is nonlinear, since = —D{U^)VxC^ is 
unknown. A suggestion for working the nonlinear elliptic problem is to 
apply the method of continuity to it in the following form. First, set 
(T = 0, so that the elliptic problem is linear (and corresponds to the old 



0 < t 



144 



Douglas, Pereira, and Yeh 



miscible model). Solve for a first approximation 17°’° to 17°: 

(/r(C°)7-ify°’°, 7 ) - (V. • 7 , P°’°) = {p{C°)gV.d, 7 ) , 7 e V, 

(V,-W^°’°,ry) = (g,77), 77 G W, 

J 70.0 _ ^rO,o_ 

(3.2) 

Now, let cr"* = mSa, where = a. Then, for m = 1, . . . , M, set 
2 - 0 , m-i ^ -D{U°’^-^)y^C° and solve the linear problem 

{p{C°)k~^W°’'^, 7 ) - (V,, • 7 , P°’™) 

= (p(C°)5V,d,7) - , 7 e V, (3-3) 

(V,-lT°’™,77) = (g,?7), V€W, 

for { 1 T°’'", P°’™} and set 

TiO.m TTrO.m I ^ ryO.m — l, 

77 °, M g] 2 ould give a very good approximation of C/°, which should then 
be obtainable in a small number of Newton iterations starting from 

Z0,M ^ 

If C° is constant, then {TT°,P°} G TZT satisfies (3.2). 

2° Assume {17"*, Vb"*, P™, Z™, C™} known for some m > 0. Then, carry 
out a full pressure time step employing the operator-splitting procedure 
outlined in 3° — 5° below. 



3° 



Let t"* < Split the concentration time step (tn;ln+i) into 

calculations for transport and diffusion. Associate transport with the 
system 



' t.X 



(j)C 

cw 



= C(7+ 



cq 



X G ft, 



(3.4) 



w ■ ly = 0, X G did, 

and apply the LCELM transport microstepping procedure (to be de- 
tailed below in over the microsteps (tn,K.-i,tn,K), k = 1 , . . . , 72, so 
that tn,i 2 = tra+i, to obtain the transport-approximation (7„+i G W. 



4° Given Cn+i, complete the concentration time step by approximating 
the solution of the diffusive system 



At - V,, • (P((Pf/)„+i)V.a+i) = 0, xGil, 
(p((ac/)I+i)v,c„+i)-7/ = o, xGdn -, 
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i.e., find {Zn+i,Cn+i} G TZT such that 

7) - (V, • 7, c„+i) = 0, 7 e V, 




(3.6) 



Ift„+1 set = {Z„+i,C„+i}. 

5° Given {Z’"+\ C""+H, find p™+i} g nT such that 



{gi{C'^+^)k-^W^+^ ,y) - (V^ -7,P™+i) 




7e V, 

r] gW, 



{V^-W^+\r,) = iq,r,), 



(3.7) 



and set 



jjm+l ^ ]^m+l _|_ 



^ ym+1 

1 + aC™+i 



(3.8) 



6° Repeat pressure time steps to the desired final time. 

The partial differential system is nonlinear, since the concentration ent- 
ers the pressure equation nonlinearly through the viscosity and the term 
including the concentration ffux, which in turn involves the dispersion tensor 
D(U). The dispersion tensor’s appearance in the concentration equation is 
another nonlinearity in the system. However, the operator-splitting proce- 
dure outlined above produces a numerical method that is linear in every part, 
with the exception that a nontrivial initial concentration can induce a single 
nonlinear elliptic problem for the initial pressure distribution. 

4 Discretization of the Pressure Equation 

The LCELM procedure to be employed herein is based on that introduced 
by the authors m to treat immiscible displacement in porous media. It was 
assumed above that the solutions of the pressure and concentration equations 
are being approximated in Raviart-Thomas spaces TZE of lowest index over 
the same rectangular partition (in two or three spatial dimensions). Conse- 
quently, the pressure and the concentration variables are approximated by 
piecewise constant functions, while each component of the fluxes W and Z is 
approximated on each element by a function that is linear in the correspon- 
ding component of the space variable and is constant in the other components 
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of the space variable. In the model for miscible displacement being conside- 
red in this paper, the Darcy velocity u plays a secondary role to the flux 
w, in that w is the natural choice for the flux in the pressure equation and 
is the velocity variable appearing in the transport part of the concentration 
equation. If the external flow is restricted to a few elements (as is usual 
in practice), then in almost all of the domain w will be approximated in a 
discrete divergence-free manner by the mixed method flux variable W. In 
the LCELM procedure that will be used to approximate transport in the 
concentration equation, it will be necessary to evaluate W (or, more preci- 
sely, EW) at the vertices of the elements; however, W, as produced by (3.7), 
is discontinuous at vertices, and some averaging or interpolation procedure 
must be applied at such points. It was observed in m that it is strongly 
advisable to devise an evaluation at vertices that preserves the divergence- 
free nature of the approximation in the neighborhood of these points. The 
method used in m and that will be repeated here is to define a bilinear or 
trilinear interpolation W of element-center values of W. It can be seen by 
a short calculation that, if g = 0 on all of the elements having a point x as 
vertex, W is divergence free on boxes centered at x and contained in these 
elements. The extrapolation of W remains divergence free in this sense and 
will be employed in the transport calculation. 

An alternative to the interpolation would be to offset the pressure par- 
tition from that for the concentration. If the element centers for the con- 
centration partition are taken as the vertices of the (rectangular) elements 
in the pressure partition, then the flux W would be continuous at the verti- 
ces of the concentration partition and W would be discrete divergence free 
about vertices in pressure elements where <7 = 0. On the other hand, D(U) 
would be discontinuous in concentration elements, which would complicate 
the diffusive part of a concentration time step. 



Let us derive the local conservation law that will provide the basis for our 
LCELM approach to the miscible problem. Recall the transport equation 



5 Transport Microsteps 
by an LCELM Technique 




(5.1) 



w ■ V = Q, 



X S dLt, 
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Following 23], we consider the space-time slice Q = x [tn,K.,tn,K+i]- Let 
/C be a reasonably shaped, simply-connected subset of fl, and define a subset 
'D — T>n,K{JC) of Q as follows. For each x S 5/C, construct the solution y{x', t) 
of the final value problem 

dy cw w 

dt (j)c (j)’’ 

y{x;tn,K+i) = X, 

and set 

Xn,i^ix) = y{x;t„^f,). (5.3) 

Then, let AC = AC„_„ be the interior of the set {xn,K{x) : x € 5/C}, and let 
T> be the tube determined by AC, AC, and the integral curves (5.2). (For Atct 
sufficiently small, the map x — >■ Xn,K is one-to-one, so that this construction 
can be carried out.) Now, denote the outward normal to dV by d{x,t) and 
note that it is orthogonal to the vector {(j)c,cwY on the lateral surface of T>. 
Then, integrate (5.1) over T>: 




= /j, (eg’*' — cq ) dx dt. 



tn.K+l t t. 






(5.2) 



Thus, mass is conserved locally in the transport step if 






•n,K+l 



dx 




cq ) dx dt. 



(5.5) 



The no-fiow boundary condition is handled in a natural way in (5.4), since 
the integral curves (5.2) do not exit Lt in this case. In fact, if a; G dLt, then 
the integral curve remains in dLl and T> has a portion of its lateral surface 
contained in dfl x tn,K+i]- Hence, no special cases arise for subsets AC 
close to the boundary for these boundary conditions. 

Before proceeding to the discretization of (5.5), let us discuss the relations 
between (5.5) and the corresponding conservation law for immiscible displa- 
cement and that employed in 2) for the older miscible displacement model. 
It is easy to see that the nondivergence form of the concentration equation 
is given by 

(j)— + w ■ VxC - Vx ■ {D{u)Vxc) = (c - c)g+. 



(5.6) 
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Thus, the integral curves determined by (5.2) coincide with the characteristics 
of the first order part of (5.6), whereas in the immiscible problem treated in 
m they do not. In the miscible displacement model treated by Arbogast, 
Chilipataki, and Wheeler | 2 | and in the paper by Arbogast and Wheeler 
|E] on linear convection-diffusion equations, our volumetric flow rate w is 
replaced by the Darcy velocity u, which is the volumetric flow rate of the 
incompressible mixture when ct = 0. Thus, the tubes T> represent, both for 
them and for us, actual flow tubes, with the consequence that the volume of 
K. must equal that of /C in the absence of external flow. This seems to have 
introduced some difficulties in their calculations, which we believe (on the 
basis of our experience with the immiscible displacement problem m and 
that reported in PJ for the older miscible model) to be related to the way 
they evaluated u at the vertices of their elements. The constraint itself does 
not arise in the immiscible problem. 

Now, let us turn to the discretization of the conservation law (5.5). It 
was seen in m that an acceptable choice for choosing the sets {K} for 
conservation is to use exactly the concentration partition, and we shall make 
this choice here. So, let K = idj for some j. We shall define K as the 
box obtained as follows. First, approximate the integral curves (5.2) at the 
vertices of K. Since the microstep At^ is usually small (he., = 10 or so) 

with respect to At^ we shall assume that it suffices to follow the tangent to 
the integral curve through each vertex of K back to the time level using 
{EW)n, K+i as defined above. Thus, if x is a vertex of K, then its predecessor 
point x„ „(x) is given by 

x„,k(x) = X - (£’W)„,„+i(x)Afct/ (/'(x). (5.7) 

If n is two-dimensional, then for sufficiently small Aid, the four points 
x„ k(K) form the vertices of a quadrilateral, K = which intersects 

at most nine elements. (In theory, we could take steps long enough that the 
predecessor set would intersect elements other than the immediate neighbors 
of flj, but it is our intention to treat very inhomogeneous media, for which 
the longer microsteps would lead to rather inaccurate simulations, at least 
if K is defined using only the vertices of K.) If lies in three-space, the 
eight points x„^k determine a box with ruled, almost quadrilateral surfaces 
for faces. The common face of two adjacent elements fli and flj is mapped 
to a common face between the corresponding predecessor sets. Denote the 
tube generated by K and K by D — D„_„(K). 
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Next, we must discretize the conservation relation (5.5) as nearly exactly 
as we can. First, given the uncertainties of real reservoir data, it suffices 
to assume that the porosity (f) is constant on each element of the partition 
{flj}. Then, the integral over K can be computed exactly. If Ll is two- 
dimensional, an algorithm for computing the integral over K was given in 
§10 of j21]. The argument for approximating the integral over the tube D 
by a trapezoidal rule in time given in §7 of [21 is independent of dimension. 
The remaining question is how to approximate almost exactly the integral 
over K in the three-dimensional case. Perhaps the simplest suggestion is to 
compute the coordinates of the image of the center of each face of K under 
the map specified above and to approximate the image of a face of K by the 
four triangles determined by the images of the vertices and center of the face. 
Then, an algorithm generalizing that of §10 in m in the two-dimensional 
case can be written down and applied. Now, assuming that each integral in 
(5.5) can be evaluated exactly or to high precision, let us replace (5.5) by the 
approximation 



^C dx — dx -t- 



AC 



+ - 



K ' 

AC 



>K 



(Cn^K+lQ~^ Cn,K+lQ ) dx 



2 JK„ 



) dx ^ K 0, . . . , ?2 f ; 



where 



^n,0 — X € n. 



Let |K| denote the volume of K. Then, Cn^n+i = C'j,n,K+i = C'n,K+i(%) 
satisfies the local, linear equation 



)K| 



Atr 



2 

H — f ^Cn^K,Q Cn^nQ ^ dx, K — 0, . . . , ?2 1- 

K.^ ^ 



K 



Q dx j Cji^i^-\-i — dx -t- 



Atr 



/K 



Cn,K+ig+ dx 



(5.8) 



The transport part of the concentration time step is completed by setting 



C. 



n-t-l 






X £ ft. 



(5.9) 



The diffusive part of the concentration time step has been adequately 
defined by equations (3.6) above. Thus, the complete concentration time step 
has been specified. And, since the pressure time step has been determined 
by (3.7) and (3.8), the entire computational algorithm has been specified. 
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Figure 1: Comparison of LCELM simulations of a homogeneous reservoir using 
128, 256, and 512 elements at 50, 250, 450, 650 days. Refinement produces a 
sharper front. 



6 Numerical Experiments 



Viscosity 


gLa = -5 cP 


Density 


Pa = 1 5 / cm 3 


Porosity 


(j) = .2 


Absolute permeability 


k = 2 mdarcy 


Viscosity function 


m(c) = P/3 ("1 + 



W^O.25 



gLfi = 10 cP 
P/3 = -7 g! cm^ 




-4 



Here, we present numerical results to show convergence of the LCELM 
method. Assume the reservoir to be 256 m x 256 m. Inject fluid (with c” = 1) 
uniformly along the left edge of the reservoir and let the (total) production 
rate be uniform along the right edge; no flow is allowed along the top and 
bottom edges of the reservoir as they appear in the graphics. Gravity will 
be neglected. The injection rate is taken to be one pore-volume every five 
years, and data below are held flxed for the computational results discussed 
here. 
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Figure Q shows a mesh refinement study for a homogeneous reservoir (cor- 
responding to a one-dimensional problem). Three different discretizations, 
with h = hx = 2 m, Im, and .5 m, were used for the LCELM simulation. 
The concentration c obtained for the three grid sizes are almost the same, 
indicating convergence for the concentration. Figure El is a mesh refinement 
study for a heterogeneous reservoir at 200 and 300 days; the permeability for 
this test was defined on a 64 x 64 grid. In this case, three different discre- 
tizations (64 X 64, 128 X 128, 256 x 256 grids) were again used. It is observed 
that flow patterns from the three different grids are almost the same; if the 
mesh size is smaller, the concentration front is sharper. 
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Abstract 

We describe discrete well models for 2-D non-Darcy fluid flow in 
anisotropic porous media. Attention is mostly paid to the well models 
and simplified calibration procedures for the control volume mixed fi- 
nite element methods, including the case of highly distorted grids. 

KEYWORDS: well models, non-Darcy flows, anisotropy, distorted grids 

1 Introduction 

Flow around high production rate gas wells deviates from Darcy’s law. This 
phenomenon has been successfully modeled by the two-term Forchheimer 

law |7]. 

In reservoir simulation, the discrete well model is a relation between the 
production/injection rate of the well, the well-block pressure and the bottom- 
hole pressure. This relation is specific to the basic approximation scheme 
used for discretizing the governing equations. Such well models are well un- 
derstood in the case of Darcy flow and are mostly based on various generaliza- 
tions of the effective radius concept pum. In the case of Forchheimer flow in 
isotropic media, we And the effective radius as a function of the dimensionless 
Forchheimer number using the invariant behavior of discrete solutions near 
the well blocks. Such invariant properties are analyzed in j0| and are assessed 
numerically by solving a set of auxiliary problems which reproduce the known 
analytical solutions around a single isolated well in an infinite domain. Our 
numerical experiments show that this calibration procedure is very accurate 
and can also be applied on non-uniform and highly distorted grids. 

In the general case of anisotropic media, there is no consensus on a spe- 
cific formulation of Forchheimer’s law that is backed by experiments or from 

Z. Chen, R.E. Ewing, and Z.-C. Shi (Eds.): Proceedings 1999, LNP 552, pp. 156-^^^ 2000. 
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first principles. In this work, we derive well models only for the simplest for- 
mulation Pj, which provides reasonably good fit to pore network simulation 
results such as those obtained by Thauvin and Mohanty Ca- 
in this case direct numerical simulation becomes a critical tool for esti- 
mating the validity of a specific well model. Such verification requires much 
higher accuracy compared to conventional reservoir simulation techniques 
and imposes strict requirements on the quality of the approximation scheme. 
To this end we have developed a modification of the control volume mixed fi- 
nite element (CVMFE) scheme 0 based on quadrilateral grids in 2-D and on 
hexahedral grids in 3-D. This scheme is conservative, nearly optimal among 
second order schemes, and naturally incorporates harmonic averaging of re- 
servoir properties. Moreover it allows us to obtain reliable results on highly 
distorted grids and even on grids that are “degenerate” in the conventional 
finite element sense (for example on grids with non-convex cells in the plane.) 

Numerical experiments in the 2-D case show that very fine grids are ne- 
cessary in order to obtain grid independent results using direct simulation. 
Such grids may be impractical in the 3-D case, especially in the case of devia- 
ted wells. Hence there is a need for high order methods suitable for accurate 
resolution of the flow in the near-well region using coarse grids. 



2 Problem Formulation 
and Governing Equations 

The governing equations that describe steady-state, single-component, single 
phase, isothermal flow in porous media are 

V-(pu) = /, fiK~^u + p/3\u\u + Vp = 0, (2.1) 

where p denotes the fluid density, u the velocity vector, p, the dynamic visco- 
sity and p the pressure. The porous medium is characterized by the permea- 
bility tensor K, the porosity cj) and the Forchheimer coefficient j3, which can 
be a tensor in some formulations. The right hand side / is associated with 
the presence of wells-localized mass inflows or outflows in the reservoir. 

Different Forchheimer law formulations are available for the anisotropic 
case. Thauvin and Mohanty, m, require that (3 = {bij} in order to fit their 
network simulation results. Knupp and Lage |0| have suggested an anisotropic 
formulation using a variational approach. Their formulation can be written 



158 



Garanzha, Konshin, Lyons, Papavassiliou, and Qin 



as follows 

_i (u • K~^u) s 

pK ^u + /3p 3 - if ^u + Vp=0, n = 2,3, (3 is scalar, (2.2) 

(det if-i) 27 T 

where n = 2,3 is the space dimension. This model has fewer degrees of 
freedom as compared to (2.1). Nevertheless, it provides a good fit to the 
data from M- However, these data are still not enough in order to choose a 
particular tensorial model. 



3 CVMFE on Distorted Quadrilateral Grids 



In order to describe the discrete approximation to system (2.1) on quadrilate- 
ral grids for each element we introduce the mapping r = r(^i, ^2) which maps 
the unit square on the quadrilateral in physical coordinates r = (a:i, ^2)^ and 
associated metric entities 

f)j* ■ 

gi = S = {sy}, = ^, J = det S, gjg* = J5^j, 



where gi , g® are the covariant and scaled contravariant basis vectors and J is 
the Jacobian of the mapping. Using above notations the governing equations 
can be written as follows: 



d 






i=i 



(det if“i) 2 a 



(3.1) 

(3.2) 



U® = u^g®, u = -- ^giU® (Piola mapping). 



i=l 




Figure E Support for the edge-centered flux base (left) and for the flux test 

function (right). 
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The Control Volume Mixed Finite Element methods (CVMFE) as intro- 
duced in ^ are based on the lowest order Raviart- Thomas (RTq) flux basis 
functions and cell-based pressures. Equation (3.1) is integrated over the grid 
cell in the parametric space with mid-point quadrature rules for the contour 
integrals. The Forchheimer law (3.2) is integrated over the edge-centered con- 
trol volumes in the parametric space and mid-point rules are used for pressure 
contour integrals. This derivation is equivalent to using the Raviart-Thomas 
flux-pressure bases, piecewise-constant pressure test functions and flux test 
functions defined by (3.3), which is illustrated on Fig. [Dfor the cases when 
the edge e locally coincides with the vector g 2 . 






I 



(l-COjgi ™ 
Cijgi in 
0 



Cright(e) 

Cleft (e) 
elsewhere 



i’e = 



jgi in Ceight(e) 
jgi in Cleft (e) 

0 elsewhere 

(3.3) 



MFE 



CVMFE 



1/4 1/8 




Figure O Equivalent first order quadrature rules and distorted grid cells. 



In 0 the value j is in fact approximated by a constant in each half-cell 
which allows the integrals over cells to be computed exactly. The basic advan- 
tage of CVMFE is the observed 0{h?) convergence 0 in terms of pressure 
and fluxes on highly nonuniform grids and in the presence of strong coeffi- 
cient jumps. Moreover it possesses optimal spectral resolution properties in 
a whole range of the discrete harmonics and provides accurate solutions on 
mildly distorted grids. The main drawbacks of CVMFE are the lack of the 
Linear Preservation (LP) property and nonsymmetric discrete metric tensor. 

It is possible to derive CVMFE alternatively as a “low order” approxi- 
mation to conventional mixed finite element (MFE) method Thorough 
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analysis of relations between finite volume and finite element methods with 
different quadrature rules can be found in | 2 | where it was shown that most 
FE methods in primal and dual formulations can be written in the “factori- 
zed” form, or as a flux differences in terms of finite volume methods. 

In |S| it was shown that there exist quadrature rules for the MFE integrals 
with first order algebraic accuracy which result in partial error cancellation 
property, namely they result in the discrete system which is identical to that 
resulting from RTq bases and flux test functions defined by (3.3) with certain 
first order quadrature rules, which are shown on Fig. El 

The resulting discrete system coincides with CVMFE E| on grids with 
affine cells, the discrete metric tensor is symmetric positive definite on ad- 
missible cells. Moreover the scheme is Linearity Preserving and is more ac- 
curate than the original CVMFE on distorted grids. The set of admissible 
grids is wider for this scheme as compared to conventional finite elements in 
the following sense: the necessary condition for the convergence of discrete 
solutions in the case /3 = 0 in fully discrete norms is that the Jacobian of the 
local mapping in the cell edge centers is bounded from below by a positive 
constant (see Fig. The convergence proof is similar to that in m and the 
invertibility of local mapping for each element is not required, i.e., the cell 
shown on Fig. 0 (center), is admissible and the local discrete metric tensor is 
positive definite and has condition number of the order of unity in this case. 
An example of degenerate cell is shown on Fig. Q (right). 

Similar conclusions are valid in the primal formulation, i.e., for the bilinear 
finite element method and control volume finite element method. 

4 Analytical Estimates for the Equivalent Well- 
Block Radius tq in Isotropic Darcy Flow 

In the case of infinite uniform grid with square cells and isotropic Darcy flow 
the generic dimensionless discrete system which comprises several well known 
approximation schemes can be written as follows: 

— (2 -|- 2w)Pij + w{Pi-ij + Pi+lj + Pij-l + Pij + l) 

+ + Pi_|_i j_i -|- Pi_i j_|_i -|- Pi+i j+i) = 

(4.1) 

sQij “t“ (1 s) “t" Qij—1 ~t“ Qij + 1^ 

+ ^{Qi-lj-1 + Qi+lj-1 + Qi-lj+1 + Qi+lj+l)) ; 

where i,j are the grid node (cell) indices. 
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We seek the solution to (4.1) in the infinite computational domain 
— oo < i,j < +00 with the following right hand side Q 

Qoo = h Q^j = 0, (4.2) 

The problem (4.1), (4. 2) is closed with the following condition 

Poo = 0: 1™ = 0; f (4.3) 

r— >-oo T 

where Ax is the side of the square grid cell. 

The solution to problem (4.1)-(4.3) exists and is unique [Q. Similar to [Q 
our objective is to find the value tq such that the following equality is valid 

lim (Pij — ^ In — ) = 0, r = Ax \/ P + p. (4.4) 

r— >oo 27T r 

Equation (4.1) leads to some popular schemes, in particular the values s = 
1, w = 1 correspond to conventional finite difference scheme (ED), s = 0,w = 
I is the bilinear finite element (BEE) scheme, while s = 0,w = | correspond 
to the control volume finite element method (CVFE) and the control volume 
mixed finite element method (CVMFE). In the latter case equation (4.1) 
is deduced from the extended system by elimination of flux variables. In 
order to underline the difference between the single-cell production term and 
multiple-cell production term we include the schemes CVFE’ and BEE’ which 
are defined by the parameters s = l,w = ^ and s = l,rc = |, respectively. 

Andreev PQ derived the asymptotic expansion for the case s = 1. It can 
be generalized for the general case s yf 1 by adding a constant c, 

Pij = c -I- ;^(ln — - ^ln2-7-|-ilnr(;)-|-of — , r = Ax\f P + p, 
Ztt r Z Z \ ’’’ J 

where 7 = 0.57722156649 ... is the Euler constant. The value of c for several 
approximation schemes was computed in |H| using the analytical solutions 
from Q and superposition principle. Comparing the above equality with 
(4.4) we obtain that 

^0 _ „-7- § In 2 + i In w+2itc 

Ax 

A simple approximate solution approach for finding rp was suggested in m 
using the observation that discrete solution in the near well cells is close to 
the analytical solution. Omitting the derivation details we obtain 

Ax 

All results are summarized in Table ^ 
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Scheme 


exact value 


exact value 


Peaceman 


error % 


BFE 

BFE’ 

FD 

CVMFE 

CVFE’ 


^ 5%/3 ,^,377 

(2+\/3) 16 e ^+32 
2\/6 
e-"' 

2V6 

e-"' 

2\/2 

1 g|( 4 - 87 + 7 r) 

e-~< 

4 


0.313833 

0.114607 

0.198506 

0.3427305 

0.140365 


21/4 g-27ir/64 
21/4 g-3ir/4 

g-7r/2 
21/6 g-37i-/8 
21/6 g-27i-/3 


0.68 

1.65 

4.72 

0.83 

1.52 



Table n Equivalent radius for different numerical schemes and Darcy flow. 



5 Calibration Procedure Based 

on the Solution of Auxiliary Problem 



The equivalent radius does depend on the discrete representation of point 
sources/sinks. The most natural discrete approximation to the i5-function is 
by the piecewise-constant hat function, which is illustrated in Fig.0 Rigorous 
analysis and convergence proofs for such approximations as applied to the 
Darcy law case can be found in We write the contribution to the right 
hand side / in (2.1) from a single well as follows 

/= 



0(r) = 



(Ai+A2)^ 
4d^AiA2 ’ 
0 , 



r G fi/,(K) 
r i flhiK) 



Lth{K) = {r : • r| < 



dXi 



Ai + A; 



-}• 



Here rg is the well location, H is the height of the perforated zone (fully 
penetrating vertical wells are assumed) , Q is the mass rate of the well, , Ai 
are the unit eigenvector and eigenvalue of K in the vicinity of the well, 
respectively, or 



K = AKA^ , A = (ai,a 2 ), A = diag(Ai), A^A = /, 



while d is chosen such that the well block can be placed inside Llh{K), e.g., 
on square grid d — Ax. 

The reason for this choice of hat function is that it is non-zero in the 
square in the transformed coordinates 

r' = A~^A'^r. (5.1) 



This typically results in multiple-cell production terms for a single well and 
the estimates for rg differ from those in and HU since the superposition 
principle should be used for its computation in the linear case. 
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Figure 0 Discrete approximation to the production terms. 



The basic motivation for this model of discrete sources and sinks is that 
numerical solutions in the vicinity of the well are more accurate compared 
to single-cell production term (typically by factor 4 to 7 on the grid with 1:3 
aspect ratio shown on Fig. 01 

The well models in the case of isotropic Forchheimer flow are based on 
the analytical solution for radial flow around isolated well jS| 



P{r) =PR + 



M Q . 

kp2TrH \R 



PQ\Q\ .1 

p{2ttHY^R r’' 



(5.2) 



Using the assumption that the finite difference solution in the cells near the 
well block is close to the analytical solution (5.2), it was shown in 0 that 
the local behavior of the discrete solution is described by the dimensionless 
Forchheimer number of the well block defined as Fo = ■ The equivalent 

radius a = rg j5x was found as a solution to the nonlinear equation 

^(l + Uo)=ln(l) + -Fo(i-l), 

2 a TT a 

In jO] the well model was derived also for the bilinear finite element approxi- 
mations, or BFE' in our notations since single-cell production term was used. 
The result looks as follows 

_ , M Q 1 f \ , PQ\Q\ 1 ^ 

kp2TrH \aiAa;/ p{2ttH)‘^ a2Aa; ’ 

where po,Pw are the well block pressure and flowing well pressure, respec- 
tively, oi = 2*e^ (see the same value in Tabled, a 2 = ^ 

and F Ri 1.35 is the empirical calibration constant. The above results have 



164 



Garanzha, Konshin, Lyons, Papavassiliou, and Qin 



provided insight into the problem, however they cannot be used in the case 
of irregular and distorted grids. To this end we suggest to find tq via solution 
of small auxiliary system using the following procedure: 

(a) choose a window around the well block; (b) write 
discrete approximation to governing equations in this 
window; (c) specify Dirichlet boundary conditions 
using the analytical solution (5.2); (d) solve discrete 
system, find the pressure in the well block pq and 
find To via ^ 

Choosing the window size to be 3 x 3 cells results in accuracy which is 
comparable to that of the Peaceman method while using well block plus 2 
cells in each direction typically allows to obtain 3 to 4 correct digits in tq. 

An attractive feature of this simple calibration procedure is that it can 
be used in the case of anisotropic permeability using the following analytical 
solution to (2.1) and (2.2) 

(r’^AT“^r)! 

(det K~^)i 
(5.3) 

This analytical solution is derived from (5.2) using the transformation of 
space variables (5.1). In this case the well model can be written as follows 

_ , h Q ^ f fw\ , PQ\Q\ 1 ^ 

Pw — Po+ ^ 1 7y~F7 ^ — /o 

(det AT) 2 p 27riJ \ro J p[2ttHY tq 

where fyj is the mean well radius computed according to HU- 



p{f) =PR + 



Q 



(det AT) 2 p 27ri/ \R 



In ( 4 



!3Q\Q\ .1 1 , 

p{2ttHY^R 




Figure 0 



rw 



^cond(AT)4 +cond(AT)' 



^ / (z^A: ^z) \ 

27t J v(det AT“ 1)2 / 
0 



d(j), 



where z^ = (cos(^), sin(</))). Now the calibration procedure is modified as 
follows: (a) fix constants R and pr; (b) solve discretized equations in a “win- 
dow” around the well with Dirichlet BC specified by (5.3); (c) find pressure 
Po in the well block and find tq as a solution to nonlinear system (5.3) using 
the equality p(fo) = po- It is important that the physical properties for the 
calibration procedure should be the same as for the reservoir simulation, i.e., 
the Forchheimer number or its generalizations should be the same in both 



cases. 
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6 Numerical Experiments 



Typical well model validation scenario requires the following stages: a) deri- 
vation of the well model; b) numerical experiments with flow around isola- 
ted well; c) numerical simulation of 5-spot flow on Cartesian/distorted grids 
using well models; d) validation using direct simulation of the 5-spot flow on 
extremely refined radial grids near wells. Table El shows the comparison of 
computed data on 201 x 201 grid with analytical solutions which clearly shows 
that the multiple-cell production terms result in more accurate solutions. 



Scheme 


Wph-pWl^ 0/ 


\\Ph-p\\L^ 0/ 


IK -pile 0/ 


Ibikr 


WpWl. 


Ibllc 


BFE 


0.0049 


0.039 


0.41 


BFE’ 


0.0074 


0.164 


2.31 


FD 


0.0060 


0.079 


0.93 


CVFE’ 


0.0032 


0.054 


0.74 


CVMFE 


0.0026 


0.018 


0.23 



Table 0 Discrete norms of errors for different schemes. (201 x 201 cells) 



Injection rate 


0.05 mmsef/day 


Number of injection wells 


4 


Injection well coordinates (ft) 


(0, 0) (200, 0) (0, 200) (200, 200) 


Number of production wells 


1 


Production rate 


4x0.05 mmsef 


Production well coordinates (ft) 


(100, 100) 


Well radius 


0.35ft 


Reservoir dimensions 


200 ftx 200 ftx 1 ft 


Initial pressure, pi 


5000 psia 


Fluid density 


1.783926 xlO-i g/cm^ 


Fluid density 




at latm, 60°F 


6.76361 • 10-4 g/cm3 


Fluid viscosity 


2.5574794 x 10"2 cp 


kii,k22{mD) 


10,10 or 10,100 


k\2 = ^21 


0 


Forchheimer coefficient j3 [ft“^] 


0, 1.71 • 104°, 1.71 • 10“ 


Boundary conditions 


No-flow at all boundaries 



Table 0 Simulation conditions. 



Two different sample coarse grid configurations for the same 5-spot flow 
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are shown in Fig. 0 



WINDOW 





Figure 0 Different grids for 5-spot well configuration. 



All data for this problem are presented in Table 0 

The well model validation results in the isotropic case for the CVMFE 
scheme are presented in Tabled Production terms and well model are used 
on the 11 X 11 Cartesian grid while in direct simulation fluxes through well 
boundaries are specified. In this case the bottom-hole pressure is the 
well model quality indicator. Very fine grids were used for direct simulation, 
typical cell size near well was about 



p 


Vo -Pi 


To /Ax 


Pw -Pl 


Pw - Pi, 

direct 

simulation 


err % 


0 


-121.683 


0.342926 


-262.149 


-262.602 


0.17 


1.71 • IQio 


-132.067 


0.35065 


-490.176 


-490.694 


0.11 


1.71 • IQii 
1.71 • IQii 


-225.451 


0.37137 


-2542.4 
-2560 0 


-2543.5 


0.04 



Tables Comparison of simulation results. 



b 


P 


O 

1 


ro/Ax 


Pw - Pi 


Py, - Pi, di- 
rect 

simulation 


err % 


0 


1.71 • 10^1 


-387.46 


0.376743 


-2543.1 


-2543.5 


0.01 


0.43 


1.71 • lO^i 


-402.745 


0.358036 


-2543.1 


-2543.5 


0.01 



Table 0 Results for the distorted grid simulations. 



The numerical results illustrating the influence of the grid distortion on 
the accuracy of the calibration procedure in the isotropic case are presented 
in Table 0 Here 6Aa: is the value of the quasi-random displacement for the 
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grid nodes. The initial 23 x 23 Cartesian grid and the resulting distorted grid 
are shown on Fig. O 

Both grids along with discrete pressure and pressure errors are presented 
on Fig. 0 The pressure errors for the 5-spot flow are computed via compa- 
rison with very fine grid results. 




k, 




K5E+0C 



)74E.»Oa 

503E+OC 





Figure!^ Pressure contour maps and error maps for regular and distorted grid 

simulations. 

The pressure errors are only 3 times larger as compared to the results on 
square-cell grids. This is quite good, given that the distorted grid contains a 
lot of poorly shaped and non-convex cells. 

The well model validation in the anisotropic case is more difficult since 
the normal flux distribution through the well boundary is not known. Hence 
in direct simulation pn, from the well model is specified while the predicted Q 
becomes the quality measure. Another observation is that as a rule of thumb 
the window around the well in the calibration procedure should be much 
larger as compared to the isotropic case for the same well model accuracy. 
The preliminary results of simulation on the 47 x 47 grid with square cells 
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are presented in Table El 



/3 


ro/Ax 


Pw - Pi 


exact Q 


predicted Q 


err % 


1.71 • 10“ 


0.46 


-2072 


0.2 


0.20015 


0.076 



Table 0 Validation results for fcn = 10, ^22 = 100. 



7 Conclusions 

Asymptotic methods and the superposition principle allow the derivation 
of exact expressions for the equivalent radius tq for various approximation 
schemes on Cartesian grids. 

An inexpensive black-box calibration procedure allows rg to be computed 
in general grid configurations for non-Darcy flows, including anisotropic cases. 

The CVMFE approximation scheme provides optimal resolution for near- 
well flow, including the case of distorted grids. The derivation of CVMFE 
via low-order “cancellation” quadrature rules for MFE integrals can improve 
accuracy on highly distorted grids and can make the admissible set of grids 
in 2-D and 3-D much wider. 

Direct simulation of Forchheimer flows is very expensive with high order 
approximation schemes being desirable. 
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Mathematical Treatment of Diffusion 
Processes of Gases and Fluids in Porous Media 

Norbert Herrmann 



Abstract 

The transport of fluids and gases in narrow pore systems is de- 
scribed by the transport equation and the material balance equation. 
In this paper we start with a typical example of such a process and 
develop the underlying parabolic partial differential equation and the 
corresponding initial and boundary conditions. Afterwards we describe 
how to reformulate the problem into a Fredholm integral equation of 
the first kind, which leads to the boundary element method. Whereas 
there exists an almost complete approach to the finite difference me- 
thod and the finite element method, comparably little is known for the 
BEM. We use the collocation method to solve the Fredholm integral 
equation of the first kind and present a convergence theorem. 

A computer program shows that the the predicted error is in good 
agreement with the calculated result. 



KEYWORDS: parabolic equation, boundary element method, collocation 
method 



1 The Physical Problem 

For a given gas A the general gas law is well-known: 
gas law p ■ V = n ■ k ■ T, 



( 1 . 1 ) 



where p is the pressure, V the volume, n the number of molecules, k the gas 
constant, and T the temperature. The concentration c is defined as 



n p 
^^V^kT' 



The transport of gases obeys the following equation: 

transport equation j = ■ grade. 



( 1 . 2 ) 



( 1 . 3 ) 



where we denote by j the flux of the gas and by the (effective) diffusion 
coefficient. 



Z. Chen, R.E. Ewing, and Z.-C. Shi (Eds.): Proceedings 1999, LNP 552, pp. 170-^^^ 2000. 
(c) Springer-Verlag Berlin Heidelberg 2000 
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A complete description of the gas transport has also to bear in mind the 
effect of adsorption a, because for an isothermic process we have 



a = /(c), 



(1.4) 



and this function / is normally a nonlinear one. If we would like to include 
/, we would get a nonlinear equation. Because we want to explain how 
the boundary element method could be applied, we omit the influence of 
adsorption (/ = 0) to have a linear equation. Later we will show where 
linearity is needed. 

The change of the concentration c means that there are sinks or sources 
in the region. This is mathematically described by the divergence divj of 
the ffux. So we get the equation 

Oc -* 

balance equation — = — divj. (1-5) 

Combining (1.3) and (1.5) leads to the following partial differential equation: 

-=div(i?^^/ grade), (1.6) 

which is a parabolic partial differential equation. 

At the start of the process we know the concentration from the experiment 
which gives us the initial condition 



f Ci for t = 0 in the interior, 

[ Ci? for t = 0 on the boundary. 



(1.7) 



As the first boundary condition we demand that no change of concentration 
take place in the centre of symmetry, say the origin: 



gradc(0,f) =0 in the centre of symmetry and for all t. (1-8) 

The second boundary condition originates from a balance equation 

dc 

. grad c = —C ■ — on the boundary for all t, (1-9) 

where C is a constant with respect to time and space. 

The above consideration can be summarized in the following system of 
equations: 

If =div(D®/%adc), 

{ Ci, for t = 0 in the interior 

cr, for t = 0 on the boundary, 

grad c(0, t) = 0 in the centre of symmetry and for all t. 

£)®// . grad c = —C ■ on the boundary for all t. 




172 



Herrmann, Norbert 



Two essential difficulties appear in this system: 

1. The initial condition is not continuous. 

2. The boundary conditions and are both of Neumann type. 



2 The Boundary Integral Equation 



Let us consider the following model problem: 



u — Au 
u{x, t) 
u{x, 0) 



/ in n X /, 

g{x,t) on r X /, 

0 for a; G ft. 



In the finite element method (FEM) we transform this model problem into a 
weak form by multiplying both sides with a test function v and integrating 
over the whole domain 12. Here we do exactly the same. But since our 
problem is time dependent, we need a further integration over the time and 
so come to the equation 



n {ii{x, t) — Au{x, t)) ■ v{x, to — t) dx dt 
! 




f{x,t)v{x,to 



t) dx dt. 



In the next step we use the above Green’s formula instead of Green’s formula 
in the FEM: 



fofo ~ Au{x, t))v{x, to — t) — u{x, to — t){v{x, t) — Av{x, t))] dxdt 

= It fr • v(x, to - t)l dj dt. 



The next step consists in applying an appropriate test function. In the FEM 
we use a trial function from a certain Sobolev space, which fulfils the bo- 
undary conditions. Here we choose the Green’s function for the (linear) 
differential operator ii — Au, and this is the reason why we are so keen to 
have a linear problem. Otherwise, we have a trouble to find a Green function. 

The Green function for the operator m — A is well-known: 

{ 1 

— « t > 0,x gM 

2V^t 

0 2 < 0, X G IR 



This function is now used as the test function 



v{xo,to - t) := G{x. 



X, to — t). 
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The most important property of Green’s function is the following identity: 



u 



{Xqj to) 



U 



{x,t — to){G — AG) dxdt. 



lo Jn 

With this formula, we get the following representation formula: 
u{xo,to)= /g° /q u{x, t — to){G — AG) dxdt 



= lo /i 



du(x,t) 

dn 



G (Xo 






dj dt 



+ /q “ Jq f{x, t) ■ G(xo - X, to- t) dx dt. 



In the theory of integral equations, parts of the right-hand side are well- 
known, so 



Ko[u]{xo,to) ■■= J J - X, to - t) dj dt (2.1) 

is called the single layer potential and 

Ki[u]{xo,to) ■■= J J u{x,t) ■ — ^J.d'jdt (2.2) 

is called the double layer potential. Indeed, if we consider the potential of 
a single plate the GauB law gives exactly (2.1) and the same is true for the 
potential of a double layer (2.2). 

We summarize a few theoretical results, which can be found in Q . 



Theorem 2.1 The single layer potential 

and the double layer potential 

Ki : H^’i{T) - A) 



are both continuous operators. 



Theorem 2.2 (Plemelj-Sochozky) Let 'll} G H~i’~i{T),w G Hi’i{T), 
and 7 describe the restriction operator to the boundary T. We abbreviate by 
[yrt] the difference of the boundary values on T, when coming from outside 
12° and from inside T* .■ 



[ju] := 7(u|n°) - 7(?^la0- 
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Then we have the following jump relations: 

I'yKo'if] = 0, [jKiw] = w. 

Note that the above equation means that the single layer potential does 
not jump, but in the double layer potential we have a jump when going from 
outside and from inside to the boundary. 

3 Neumann Problem of a Diffusion Process 

We now apply these theoretical considerations to our model problem. Let 
12 be a bounded convex domain in and let its boundary T = 312 be 
sufficiently smooth. Then we consider the following Neumann boundary value 
problem: 

= D-ff -Ac{x,t), (a:, 2) G 12 X (0, T] , 

c(x, 0) = Co(a:), X G 12, (3-1) 

= f{x, t), (x, 2) G r X (0, T]. 

We know from 0 that if T = 312 G f continuous, and Cq G in an 
enviroment of T, then there exists a unique solution. The Green function is 

With this function, we get our representation formula 

c{xo,to)= -fofr - Xo,t- to) ■ c{x,t) dt 

So Sr 2 to) 4^c(x, 2 ) d^j, dt 

+ Sn G{x — Xo,t) ■ Co{x) dx, Xo GTI, Q <to <T. 

For Xo — t T, we have to observe the jump relation and come to the following 
equation which is now valid for Xo G F: 

\c{xo, to)= -p-v. /p /p J^G(x - Xo,t- to) ■ c(x, 2) djj; dt 

+ /o /p G(x - Xo,t- to) §^c{x, 2) d'^x dt 

+ G(x — Xo, t) ■ Co(x) dx, Xo GT,0 <to <T. 

Now, including the boundary conditions, we are led to an equation which 
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could be called the mathematical model of our problem: 

ic(xo, to)+p.v. f* /p £^G(x -Xo,t- to) ■ c(x, t) djx dt 
= f* /p G(x - Xo,t- to) ■ fix, t) d')x dt 
+ G(x — Xo, t) ■ Co(x) dx, Xo & T, 0 < to < T. 

4 The Collocation Method 

As a model problem, we consider a ball where our diffusion process should 
take place. Because of the symmetry we can restrict our attention to a two 
dimensional ball, i.e., a circle, so that the boundary T is the circle line shown 
below: 




We divide this line into N parts by fixing the points X\,X 2 , ■ ■ ■ , xn+i = xi. 
We call 

Bj . arc ixj,Xj^i), . Xj,Xj^i. 

It is not necessary that all the parts are of equal size, but they should be 
quasiuniform, i.e., we demand: 

max |Bj|/min |Bj| < const. 

j 3 

We introduce the abbreviations 

h := max |B' |, k := T/M, t„ = p - k, p = 1, . . . , M, 

3 
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where M is the number of time steps. As the first approximation we consider 
the polygon F', formed by Fi, . . . , Fat, instead of the circle line F, and solve 
the boundary integral on F'. In the same way we have to use the inner region 
n' of F' instead of fl. Then we use the ansatz 

M N 
p=i 

where consists of the following functions: 

x' G F', t G (0,T] 

with ifj being the linear spline basis functions on F', j = 1, . . . , \p the 
constant spline basis functions on [0,T], p = 1, . . . ,M. We define the finite 
dimensional ansatz space 

:= span p. 

Now, our question consists in finding 

{c’j)l<j<N,l<p<M- 

such that 

Ic^j+p.v. fg /p, -^G{x' -Xj,t- tp) ■ c{x', t) d-ycc' dt 
= fo fr' tp) ■ iPhf){x', t) dt 

+ G{x' — Xj,tp) ■ Co(x') dx'. 

This equation looks complicated and indeed it is complicated. It is a simple 
system oin-M linear equations with the N-M unknowns c^, but the entries of 
the system matrix are singular integrals where we have to use their principal 
values. On the other hand, this procedure is not as tedious as the application 
of the finite element method for solving the boundary integral equation. The 
FEM leads to double integrals over the boundary combined with a time 
integration. 

5 Convergence 

Important not only for the theoretical background but also for the nume- 
rical approximation is the following result, which states the stability of the 
collocation method: 
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Theorem 5.1 There exists £o > 0 such that, for all grids onT x [0,T], with 

So > s := Vk/h > 0, (5.1) 

the coefficient matrix Kn is diagonally dominant and we have 

ll^i^'lloo <C, 

where the constant C > 0 is independent of h, k, and N. 



The restriction (5.1) arises typically in the finite difference method. It 
means that we can not refine the space step h without refining the time step. 
In the finite difference method this is needed to ensure the stability, and here 
we also see that we get stability in that the inverse of the coefficient matrix 
is uniformly bounded independent of N. 

From the above theorem we conclude the convergence result, which gives 
linear convergence of the collocation method, linear in space and time. 



Theorem 5.2 Let u be the exact solution o/(3.1), Uh ■= PhU (the projection 
of u onto and e{x' ,f) := Uh{x' ,i) — u{x,f), x' G F', t G I. There 

exists a positive constant C > 0 such that 

||e||oo := max \e{x' ,t)\ < C ■ {h + k). 

x'er',tei 

6 Numerical Results 

With the help of a computer program we solve the following problem: Given a 
ball of radius one and an oval shown below, we choose w = 1 to be the solution 
and adapt the boundary and initial conditions, respectively. Therefore, we 
are able to compare the exact solution with the approximate solution and it 
is possible to calculate the L°°-norm for their difference. The result is shown 
in the figure below. It can be seen that we have indeed a good agreement 
with the theoretical result in Theorem 5.2. 
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Implementation of a Locally Conservative 
Eulerian-Lagrangian Method Applied to 
Nuclear Contaminant Transport 

Chieh-Sen Huang Anna M. Spagnuolo 



Abstract 

Recently, in the study of computational geosciences, a new Locally 
Conservative Euler-Lagrangian Method (LCELM) P] was introduced 
by Douglas, Pereira, and Yeh. They have shown superior results to 
those using the Modified Method of Characteristics (MMOC) and the 
Modified Method of Characteristics with Adjust Advection (MMO- 
CAA) for the problem of two-phase, immiscible, incompressible flow in 
porous media. The object of this paper is to implement the LCELM 
applied to the transport of a high-level nuclear decay chain for the pur- 
pose of locally conserving the mass of each element in the chain. This 
method is coupled with mixed hnite elements for the spatial discre- 
tization of each concentration equation and for the pressure equation. 
Computational results comparing the LCELM, the MMOC, and the 
MMOCAA are presented. 

KEYWORDS: LCELM, MMOC, MMOCAA 

1 Introduction 

This work is an extension of the work done in ^ , where the modified method 
of characteristics (MMOC) and a modified method of characteristics with 
adjusted advection (MMOCAA) were applied to approximate a finite number 
of concentrations corresponding to elements in a nuclear decay chain. See 
0 for the development and proof of convergence of the MMOCAA in this 
miscible displacement problem. 

High-level nuclear waste is contained in engineering barriers and then it 
is buried. For the purpose of assessing nuclear waste repository sites, we 
are interested in tracking the contaminants that might escape containment 
and enter saturated groundwater flows. In particular, an important chain, 
namely 

234u 230j.f^ 

Z. Chen, R.E. Ewing, and Z.-C. Shi (Eds.): Proceedings 1999, LNP 552, pp. 179-^^^ 2000. 
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is considered in 00. 

We let n denote our domain which represents the reservoir. Let d{x), 
a: £ n be the local elevation. Since we are considering contaminants that 
have half-lives on the orders of tens of thousands of years, a change in tem- 
perature will not affect any of the decay factors. Therefore, we assume that 
the phenomena occur at a constant temperature. Furthermore, we assume 
that we are dealing with very low concentrations of each contaminant. This 
implies that the viscosity fi and the density p of the fluid are independent 
of the concentrations. Let be the number of contaminants in the decay 
chain, and let c^, a = l,...,7Vc be the element. Furthermore, denote 
by 9 = q{x, t) a macroscopically distributed source term; q is positive at in- 
jection points and negative at production points. Let J = (0,T] be the time 
interval of interest. 

Recently, a fast, accurate, and stable numerical method 0, the locally 
conservative Eulerian-Lagrangian method (LCELM) for transport-dominated 
diffusive systems was derived. The LCELM retains the computational effi- 
ciency of the MMOC and its variant, the MMOCAA. Moreover, the LCELM 
preserves the desired conservation principles locally, while the MMOCAA 
conserves them globally, and the MMOC does not at all. 

It should be noted that the restriction of the LCELM to miscible flow in 
porous media is essentially the same as the characteristics-mixed method in 
0. The fundamental difference between these new methods and the MMOC 
and the MMOCAA is that the new methods consider the partial differential 
system in divergence form and then split the transport from the diffusion, 
rather than relate to the nondivergence form and make use of the charac- 
teristics associated with the first-order transport part of the system in a 
fractional step procedure that splits the transport from the diffusive part of 
the system. It is the use of the divergence form that allows the localization of 
the transport so that the desired conservation property can also be localized. 

This paper closely follows the computational ideas in 0. There, immis- 
cible displacement is considered; here, the flow is miscible and there is a 
finite number of contaminants to trace, giving a vector concentration. The 
LCELM applied to high-level nuclear contaminant transport is an alteration 
of the original MMOC and MMOCAA for miscible flow 0 that achieves local 
mass conservation of each component in the system. A set of comparisons 
showing the MMOC, the MMOCAA, and the LCELM are given for the first 
element in the decay chain. 



LCELM Applied to Nuclear Contaminant Transport 



181 



The organization of the paper is as follows. The flow system is described 
in §2. The finite element spaces to be used for the spatial discretization are 
introduced in §3. In §4, the problem is discretized in time and the details of 
the LCELM procedure are given; in §4.1, the LCELM will be described in 
the differential setting, followed by the LCELM transport step in §4.2, the 
diffusive step in §4.3, and the pressure calculation in §4.4. Numerical results 
are given in §5; comparisons of the MMOCAA, MMOC, and LCELM will be 
presented. 



2 The Flow System 



For OL = l,...,A^c) the following equations govern the flow of a high-level 
nuclear decay chain through porous media: 



k 

u = —{ — {Vp + pgVd)), X € fl, t G J, 
V • u = q, X G Tt, t G J, 



(2.1a) 



' t,X ' 



OlCq, 



CaU 



- V • (Da{u)VCa) 



— AcTckCc “t“ Aa— 1 Cq,— 1 Cq,_ 1 -t“ Co^q^ 

CaU ■ nn = {Da{u)Vca) • no = 0, 



X gQ., t G J, 

X G do., t G J, 



(2.1&) 



c„(a;, 0) = CQ,,o(a;), xGTI, 

where Aq = tq = cq = 0 and nn is the outward unit normal to fl. In (2.1a), 
p = p{x,t) is the pressure, g is the gravitational constant, k = k{x) is the 
permeability of the medium, and </> = <)'(x) is the porosity of the medium. 
Then, with u = u{x, t) denoting the volumetric flow rate, the first equation 
represents Darcy’s law, and the second reflects the incompressibility of the 
fluid. For a= 1, . . . , A^c, equations (2.1b) are the mass-balance equations for 
the concentration Cq- of the contaminant in a chain that consists of Nc 
elements starting with c\. Note that (2.1b) is in divergence form. Let Aq, 
for a = 1, . . . , AI 3 be the decay constant of the element. Also, since each 
species has the potential to adhere to the rock during the flow process, we 
denote by ra the retardation factor of the contaminant for a = 1 , . . . , Nc- 
For simplicity, the boundary conditions reflect the assumption that the pe- 
riphery of the reservoir is impermeable. Compatibility to incompressibility 
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requires that 



qdx = 0. 



( 2 . 2 ) 



Following j2j for each contaminant, we assume = Da{(j),u), for a = 
1, . . . , Nc, is the dim(n) xdim(n) matrix, 



Ha — 4^ [da, mol \u\(^da,longH(^U) + da,transH (zi))], 



representing diffusion. In (2.3), da,moi is the molecular diffusion coefficient, 
and da, long and da,trans are, respectively, the longitudinal and transverse di- 
spersion coefficients for the contaminant. The matrix E is the projection 
along the direction of flow given by 

E{u) = \u\~‘^[uiUj], (2.4) 

and E^ = I — E. Usually, da, long is considerably larger than da,trans- The 
term Cq is the specified concentration of the contaminant whenever q is 
positive, and it is the unknown concentration Ca when q is negative. Finally, 
the last equation in (2.1b) specifies the initial condition. 



3 Spatial Discretization 

by Mixed Finite Elements 



For simplicity, we do not take gravity into account from this point. Below, 
we rewrite the flow equations in terms of flux vectors, because we use mixed 
formulations for our space and time discretizations: 



V • u = g. 



u = Vp, 

d 



(3.1) 



/ </>r„Ca \ ^ ^ ^ ^ ^ 

^ t,x * I I H“ V * V(y — ^0:7 '^a — v^-^) 

V CaU J 

where ga = —^af'aCa + ^a-if'a-iCa-i + {ca — Ca)q'^- Our numerical procedure 
is developed starting from equations (3.1) and (3.2). 

Let 

n = [0,LX] X [0,LY], 



and set H = {HX,HY}, where HX = LXfNX and HY = LYjNY. Then, 
let Xi = iHX and Yj = jHY, and define the elements of the partition T = 
T{H) = {M„- : i = l,...,NX,j = 1,...,NY} by M„- = [X,_i,X,] x 
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[Yj-i,Yj]; T will serve for both the pressure and the concentration equations. 
Let 

V = V{H) = {u G H{div \Mij & 'Pi,o x ’^ 0,1 and v ■ n = 0 on dfl} , 

W = W(H) = {w : w |m.,£ ^o} C 

where Pk denotes the set of polynomials of total degree k and Pk,( denotes 
the tensor product of polynomials of degree A: in cc by those of degree £ in y. 
Then, set 



i.e., the lowest index Raviart-Thomas mixed finite element space over the 
partition T. 

We shall seek an approximate solution to the system (3.1), (3.2). 

4 Discretization in Time 

Following w, we employ a time-discretization procedure based on operator 
splitting ideas. To do this, let 



where a\ and 02 are positive integers. Let t"* = mAtp and denote by /"* 
a function / evaluated at time t™. Similarly, let = nAtd, fn = f{tn)i 

fn,n ~ f{£n,K)t where t^ i^ — An T I^Attj-j and fm,n ~ f where 

' A ' -I- nAtd- 

The pressure is approximated at times t™, to = 0, 1, ...,. For each concen- 
tration equation, the LCELM procedure is applied. It splits into transport 
microsteps and diffusive fractional steps, in which the concentration is com- 
puted at times t„,K and tn, respectively, n = 1, 2, ..., k = 1, ..., 02 . A detailed 
algorithm is given in the following subsections. 

4.1 The Differential LCELM Procedure 

Recall that the concentration equations can be written in the divergence form 
as 



M = M{H) = V X VF; 



Atp — a\Atd-j 



Atd — a,2Atij-., 



(4.1) 




(4.2) 



where Pa = —XaTaCa + Xa-lTa-lCa-l + CaQ for O = 1, . . . , N^- 
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Then, the fractional stepping procedure for it corresponds to the transport 
equation 



= 9o 



CaU 



followed by the diffusive part given by 

dco 



+ divua = 0. 
at 



(4.3) 



(4.4) 



Following the notation in [5|, consider the space-time slice Q = x 
[in.K, tn,K+l]- Let /C be a reasonably shaped, simply-connected subset of fl, 
and define a subset V — T>„_„(/C) of Q as follows. For each x S dIC, construct 
the solution j/(x; t) of the final value problem 

dy u , , , , 

yw ~ yr 7 ^n,K+l ^ ^ ^ ^n,K+l) ~ ^7 (^-b) 

at (pra 

and set 

Xn,K{x) = y{x;t„^f,). (4.6) 

Then, let /C = /C„,k be the interior of the set {xn,K{x) : x G dIC} and 
T> the tube determined by /C, JC, and the integral curves (4.5). (For Attr 
sufficiently small, the map x — >■ Xn,K is one-to-one, so that this construction 
can be carried out.) Now, denote the outward normal to dV by a{x,t), note 
that it is orthogonal to the vector {(praCa,CauY on the lateral surface of V, 
and integrate (4.3) over V: 



' t,X ‘ 



] dxdt= [ ( ) -adA 

JT> \ Call J J dT> V CaU J 

= L4>XaCa{tn^K+l,x)dx - Jj^4>raCa{tn,K,x)dx ( 4 - 7 ) 



= /.p ga dx dt. 



Thus, mass is conserved locally in the transport step, as defined in (4.3) 
above, if 

/ (praCa{x,tn,i^+i) dx = _(j)raCa{x , tn,ii) dx + / gadxdt. (4.8) 
JK Jk Jv 



4.2 The LCELM Transport Microstep 

Computation results show that letting = Mij is a reasonable choice. 
Let’s define "fia,ij as the quadrilateral obtained by approximating the integral 



curves (4.5) at the vertices Xij^k, A: = 1, . . . , 4, of K^-. Let 

Tm,n,K-\-la, 

J 



'<k,n,K — 



(firo 



-AU 



(4.9) 
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where is the interpolation of the known velocity at times tm and 

tm+i at time tm,n,K,+i defined by 

l'(K+l)Attr\ m+l 

M, ) “• + [ At, ) ■ 

Note that there are copies of the predecessor set of K^-, one for each a. 
Since the microstep Attr is usually small with respect to Atd, in (4.9) we 
follow the tangent to the integral curve through k+i) back to the 

time level Note that the trace back is in exactly the same direction as 
the characteristic; however, in the LCELM it associated with the vertices of 
K, rather than in the interior of K as in the MMOC and MMOCAA. 



r77l,n,K+l 



= 1 - 




Figure 1: The heterogeneous permeability field (in millidarcies) 
with a coefficient of variation of 2.86 on a 64 x 64 grid . 



Next, we must discretize the local conservation relation (4.8). Using the 
trapezoidal rule, we have 




dx — 




(f)raZ 



<A,n,K 



dx+ 



Attr 

2 




9a dx + 
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where Za^n,K is used to denote the solutions of concentration in the transport 
step. This is an explicit calculation and only needs to be performed when 
Pa does not vanish. Moreover, unlike the immiscible problem, it is a linear 
equation so the solutions can be solved directly. 





Figure 2: The pictures are results of a mesh refinement study. 



4.3 The LCELM Diffusive Fractional Step 
for the Concentrations 

After performing the transport microsteps, we have the functions 
Ca,n = ^OL,n,a2-l (C+l) G W 



which will be the initial conditions (one for each a = 1, . . . ,Nc) at time 
for the diffusive steps. 
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We shall apply the mixed finite element method to the equations 

Va,n+1 = -Da{u){VCa), + divVa,„+i = 0, (4.10) 

^td 

subject to the boundary conditions 

Va,n+1 ■n = 0. (4-11) 



Thus the mixed finite element equations take the form 



( “ (Ca.n+I, div u) = 0, V eV, 

\Da[u) ^ ) 

+ ( div Wa,„+ 1 , w) = 0, w GW. 

4.4 The LCELM Pressure Calculation 



(4.12) 



The equations for {■u’”,p’”} are given by (see (3.1)) 

V-u'^ = q, = (4.13) 

subject to the boundary condition u™ • n = 0 on 9f2. The corresponding 
mixed finite element equations are 



— (p'", divu) =0, V GV, 
(divu™, w) = (g, w), w G W. 



(4.14) 



5 Implementation 

The algorithm described in this paper is applied to the decay chain —>.230 
Th —>226 preliminary study, only limited numerical results are 

presented here; more intensive numerical simulations will be carried out else- 
where. A heterogeneous reservoir (see Figure 1) with a coefficient of variation 
of 2.86 is used. The reservoir is initially saturated with water. The leakage 
(or injection) of uranium from its container into the reservoir occurs at the 
left-hand side of the reservoir (slab geometry). The injection is at a constant 
rate of 500 years per pore volume with a concentration of uranium of le-5. 
Only the results of uranium are presented. The values in Table 1 are fixed 
in all of the computations. 

The relative mass-balance errors using the MMOC in Fig. 2 are 11.9% and 
10.6%, for the grids from left to right, respectively. Therefore, the MMOC is 
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Table 1: Parameter Values 



Viscosity 

Porosity 

Decay Constant 

Retention Factor(^^^C) 

Dispersion(djo„g) 

Dispersion(d(rons) 

Dispersion(dmoi) 



/i(poise/((g/cc)) = 0.01 
^ = 0.01 

Ai = 9 X 10“^^(l/sec) 

n = 12 

diong = 1 X 10“^(cm^/sec) 
dtrans = 1 X 10“"^ (cm^ /sec) 
dmol = 1 X 10“^(cm^/sec) 



more accurate when the grid is finer, which is what we expect. However, the 
MMOCAA and LCELM conserve mass exactly, regardless of grid size. Notice 
that in the MMOCAA and LCELM figures, the fingers are more defined and 
reach the well faster than those in the MMOC. However, the fingers in the 
LCELM results are not more defined than those in the MMOCAA results, 
but they do reach the well faster. 

The pictures in Figure 2 are results of a mesh refinement study of uranium 
in a heterogeneous reservoir with a coefficient of variation of 2.86 after 5100 
years of simulation using the MMOC (top row), the MMOCAA (middle row) 
and the LCELM (bottom row). The grids have elements of sizes 64 x 64 
(left-hand side pictures) 128 x 128 (right-hand side pictures). 
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Application of a Class of Nonstationary 
Iterative Methods to Flow Problems 

XiuREN Lei Hong Peng 



Abstract 

Convergence of a certain class of nonstationary iterative methods 
applied to the numerical solution of algebraic linear systems arising in 
flow problems is studied. The iteration matrix of these methods can 
be expressed by a constant matrix plus a variable matrix tending to 
zero. The conclusions of convergence based on the matrix spectrum 
are given and applied to a class of semi-iterative methods. 



KEYWORDS: algebraic linear system, iterative method, convergence, matrix 
spectrum 



1 Introduction 

Iterative methods for obtaining the numerical solution of algebraic linear 
systems can be classified into two broad groups: linear stationary methods 
and linear nonstationary methods. For the stationary case, there is a well- 
known sufficient and necessary condition for convergence; i.e., the spectral 
radius of the associated iteration matrix is less than one. On the other hand, 
the problem of convergence of the nonstationary methods is more complex. 
For this case, the result based on the spectrum radius analysis is not generally 
applicable. 

However, as we prove in this paper, for a certain class of nonstationary 
iterative methods whose iteration matrix can be expressed by a constant 
matrix plus a variable matrix tending to zero, it is possible to establish a 
sufficient condition for convergence based on the spectral radius. Also, we 
apply the result to a class of semi-iterative methods [1], e.g., the Chebyshev 
semi-iterative methods, and obtain the relevant conditions for convergence of 
these methods. The result in [2] is a particular case of this paper. 

Z. Chen, R.E. Ewing, and Z.-C. Shi (Eds.): Proceedings 1999, LNP 552, pp. 190-^^^ 2000. 

(c) Springer-Verlag Berlin Heidelberg 2000 



Application of a Class of Nonstationary Iterative Methods 



191 



2 Convergence of Nonstationary Methods 

We consider the system of linear equations 

Ax = b, ( 2 - 1 ) 

where A is an TV x V nonsingular matrix and 6 is a vector of length N . Let a 
class of nonstationary iterative methods consistent with (2.1) be written as 

+hn, n> 1 , ( 2 . 2 ) 

where the condition is satisfied 



Gn — G + Un, 

G is a constant matrix independent of n, and 

C/„ — >■ 0, n — >■ oo. 

By (2.2), we can write 

+iV„, n > 1, 

where 

Tn = G„G„_i • • • Gi, 

Kn = hn + Gnhn-l + G„G„_i/l „-2 + ’ ’ ’ + G„G „-1 ' ’ ' G2T11. 



(2.3) 



Theorem 2.1 (Theorem 3.1 in [3]) The method (2.2) converges if and only 

lim Tn = 0. 

n-s-oo 

Let S'(T) denote the spectral radius of a matrix A; i.e., 

S'(xl) = max |A|, 

where A is an eigenvalue of A. 

We need the following three lemmas from [4, pp. 141-146]. 

Lemma 2.1 Given a matrix A of order N and e > 0, there exists a matrix 
M such that 



S{A) < IlMAM-ijloo < 5(^)+e. 
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Lemma 2.2 Let A be an N x N matrix and e > 0. There exist two positive 
constants p,i> Q and a > Q, depending on A and e, such that if for a sequence 
of N X N matrices Lfk with 

I|^fc||c30^/^l5 h 1 , 2 ,..., 

where 

/xi = e/(2cr), a = \\M\\ao\\M~'^\\ao, 

then we have 

m 

1[{A+Uk) <a{S{A) + er, m=l,2,.... 

k—1 QQ 

Lemma 2.3 Let A be an N x N matrix such that S{A) > 1 and e > 0 such 
that S{A) — e > 1. If for a sequence of N x N matrices Uk there exists 
6 = 6{A, e) > 0 such that 

\\Uk\\i<S, fc = l,2,..., 

then the sequence of matrices 

m 

J]:= J](^+[/,) 

m k=l 

is not convergent when m — >■ oo. 

Theorem 2.2 Let G be given by (2.3). Then, (a) the iterative method (2.2) 
is convergent if S{G) < 1; (b) it is not convergent if S{G) > 1. 

Proof: (a) The desired result can be obtained by Lemmas 2.1 and 2.2 and 
Theorem 2.1. (b) By Lemma 2.3 and Theorem 2.1, we obtain the desired 
result. [] 

3 Convergence of Semi-iterative Methods 

Now, we apply Theorem 2.2 to a certain class of semi-iterative methods for 
obtaining the solution of (2.1). Let the iterative method for (2.1) be written 
as 

x„+i = Gxn + k, n > 0. (3.1) 

By [5], we get the acceleration polynomial 
(5o(a;) = 1, Qi(x) = i^ix - + 1, 

(^) — Pn-t-1 (^n+1^ “t“ 1 ^n+1 )Qn (^) (f Pn-i-l'jQn— 1 (^) j ^ ^ 1 j 
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where pi and Vi are real numbers. The corresponding semi-iterative me- 
thods can be obtained by the three-term recurrence relation 

= Vi{Gu^^^ -I- fc) -I- (1 — 

«("+!) = -I- A:) -I- (1 - -I- (1 - Pn+l)u^'^~^\ 

(3.2) 

for n > 1. We can rewrite (3.2) as 

= Pn |KG+ (1 - + Vnk^ + (1 - /0„)U^”“^^. 



Then we have 



u" 



i.e., 



where 



0 



,n-2 



(1 - Pn)I Pn[VnG + (1 ~ Vn)I] J \u^~^ 
l/(") = G* -h 



■ Pn ^n\ T i 1 

K ) 



G* = 



0 _ / 

(l-p„)/ Pn[l^nG + {I - n„)I] J ^ 

K=Pn^n{l), V^-)={-::p. 



(3.3) 



Theorem 3.1 Let lim„_>oo = p, lim„_>oo v^ = v, G he given by (3.1), and 
^i=((l-p)I p[j,G+{l-n)I])- 

Then, (a) the method (3.2) is convergent if S{G\) < 1; (b) it is not convergent 

ifS{GG>i. 



Proof: Set 



Hn = Pn[VnG+ (1 - = p[vG + (I - v)I], 

Then (3.3) can be written as 



G1 = Gi 



Un 



where 



Un 



( 

\{P- Pn)I 



0 ) 



lim Un = 0, 

n—^oo 



Since 
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by Theorem 2.2, we complete the proof. [] 

The Chebyshev semi-iterative methods, discussed in [2], is a particular 
case of the semi-iterative methods mentioned here. The results in [2] can be 
obtained from Theorem 3.1. 
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Abstract 

The rapid development of parallel computers has provided a hard- 
ware background for massive refine reservoir simulation. However, the 
lack of parallel reservoir simulation software has blocked the applica- 
tion of parallel computers on reservoir simulation. Although a variety 
of parallel methods have been studied and applied to black oil, com- 
positional, and chemical model numerical simulations, there has been 
limited parallel software available for reservoir simulation. Especially, 
the parallelization study of reservoir thermal recovery simulation has 
not been fully carried out, because of the complexity of its models and 
algorithms. The authors make use of the message passing interface 
(MPI) standard communication library, the domain decomposition me- 
thod, the block Jacobi iteration algorithm, and the dynamic memory 
allocation technique to parallelize their serial thermal recovery simu- 
lation software NUMSIP, which is being used in petroleum industry 
in China. The parallel software PNUMSIP was tested on both IBM 
SP2 and Dawn lOOOA distributed-memory parallel computers. The ex- 
periment results show that the parallelization of I/O has great effects 
on the efficiency of parallel software PNUMSIP; the data communica- 
tion bandwidth is also an important factor, which has an influence on 
software efficiency. 



KEYWORDS: domain decomposition method, block Jacobi iteration algo- 
rithm, reservoir thermal recovery simulation, distributed-memory parallel 
computer 

1 Introduction 

The rapid development of parallel computers provides a solid hardware foun- 
dation for developing parallel software used in industries such as oil reservoir 
recovery and airplane manufacture. There are mainly three kinds of parallel 
computers, i.e., shared-memory, distributed-memory, and parallel computers 
of the mixed type. The typical vector computers are Cray series, such as 

Z. Chen, R.E. Ewing, and Z.-C. Shi (Eds.): Proceedings 1999, LNP 552, pp. 195-^^^ 2000. 
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Cray90. The well-known IBM SP2 and Dawn lOOOA are distributed-memory 
computers. As for shared-memory parallel computers, the scales of compu- 
tation models are limited by the bandwidth of data bus and CPU number. 
But for a distributed-memory one, it has good extensibility and does not li- 
mit the computation scale. However, accessing to the memory of a neighbor 
CPU has to be realized by programmers. It increases the complexity of pro- 
grams. It is noticeable that a mixed- type of computers are developed such 
as 0ringin2000 of SGI Company. This kind of computers greatly decreases 
the complexity of programs. 

In the past 20 years, the study of parallelization of serial software of oil 
reservoir recovery simulation has been carried out on all these kinds of par- 
allel computers [1, 16, 15, 6, 9]. It focuses on parallelization strategy, linear 
equation solution and parallel software application [12, 9, 13, 10]. Wallis , et 
al. [17] developed a nested algorithm to solve linear equations and obtained 
high speedup in distribution calculation environment. Killough [7] applied 
the domain decomposition method (DDM) based on local grid refinement 
to the black oil and compositional models. Briens, et al. [1] studied the 
stability of parallel software of the compositional model. Mejerink [11] paral- 
lelized the black oil numerical simulation software Bosium by using the block 
Jacobi iteration method. Rame [16], et al. parallelized the chemical drive 
software UTCHEM by using the vector parallel technology. It can be seen 
that parallelization of software of the black oil, compositional, and chemical 
models has been deeply studied. In particular, the parallelization algorithms 
of linear equation solution have been applied to the calculation of the black 
oil model numerical simulation. However, there has been no literature about 
parallelization of the thermal recovery simulation. 

Most of serial software are parallelized mainly in the part of linear equa- 
tion setup and solution. For most of reservoir simulation software, the esta- 
blishment of coefficient matrices and solution of linear equations, respectively, 
takes 30-40% and 50-60% calculation time; the PVT and rock physical pro- 
perty parameter calculation and the initialization of data fields take less than 
10% calculation time. Therefore, parallelization of linear equation solution 
is the key to obtain high parallel efficiency. Under a shared-memory compu- 
tation environment, a high parallel efficiency can be obtained in such mode. 
However, for a distributed-memory computer, the memory of a node is li- 
mited and it may be impossible to load data without parallelization of the 
data load. Besides, data output is also an important factor, which leads to 




Reservoir Thermal Recover Simulation 



197 



communication between nodes. Especially, for large-scale reservoir thermal 
recovery simulation, such as history match and whole-scale reservoir simula- 
tion, the frequency of I/O operation is high and the scale of data stream is 
large, because of frequent operation of large number of injection and produc- 
tion wells. In general, data I/O is a bottom neck of reservoir simulation. In a 
distributed-memory computation environment, it is necessary to parallel the 
I/O part of simulation software. 

To meet the increasing requirement for the model scale of refine reservoir 
simulation, the authors parallelize the thermal recovery simulation software 
NUMSIP by using DDM and test it on distributed-memory parallel compu- 
ters. A parallelization strategy is designed to parallelize not only setup and 
solution of linear equations, but also I/O, PVT and rock property parameter 
calculation and data initialization to improve efficiency of programs. If the 
boundaries between sub-domains are along natural faults, there is no commu- 
nication between such sub-domains. In such cases, the authors apply an irre- 
gular non-overlapping DDM to reduce communication between sub-domains. 
The linear equation solution algorithm is the block Jacobi iteration method. 
To guarantee the compatibility of software, the standard message interface 
MPI is used to realize the data communication. PNUMSIP is tested on both 
IBM SP2 and Dawn lOOOA. 

This paper consists of five parts. The model and algorithm of the serial 
software NUMSIP is introduced in §2. In §3, we give the parallelization 
strategy, data structure, and data communication mode of PNUMSIP. The 
test results are shown and analyzed in §4. We conclude with a few remarks 
in §5. 



2 Software NUMSIP 

NUMSIP is a simulation software of reservoir thermal recovery. It has been 
widely used in many oil fields in China, such as Liao He Oil Field, Sheng Li 
Oil Field, and Xing Jiang Oil Field. It can be used for designing reservoir 
recovery schemes of old and new oil fields, the history match, the production 
prediction, and the study of residual oil distribution. 

The following physics factors and processes are presented in the model of 
NUMSIP: 

a) viscosity, gravity and capillary force, 

b) heat conduction and convection processes. 
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c) heat losses of over burden and under burden of reservoir, 

d) effect of temperature on physical property parameters of oil, gas and water, 

e) rock compression and expansion. 

But kinetic, heat transfer, mass transfer, and the thermal cracking of hydro- 
carbons caused by molecule diffusion and dispersion are not considered. 

The governing equations include the mass conservation equation, the 
energy conservation equation, and the mole fraction and saturation constraint 
equations. 

1) The mass conservation equation 

(/ / Iv + f fs(pjXijVj)ds) 

(z.l j 

= 0, Z = 1, . . . , Me, 

where (p is the rock porosity, pj the density of the material in phase j, Sj 
the saturation of the material in phase j, Vj the surface flow velocity of the 
material in phase j, \ij the mole fraction of composition i in phase j, qt the 
source item of composition z, and Np the number of phases. 

2) The energy conservation equation 

f f fvw{p PjSjUj + {1- p)prockCp{T - Ti)^ dz; 

(z.z) 

+ / Jsi^h + <Zc)ds -|- Qh + Qc = 0, 

where Uj is the inner energy of the material in phase j, Prock the rock density, 
Cp the rock heat capacity, Ti the temperature of component z, qh the enthalpy 
flow velocity, qc the heat flow velocity, Qc the heat source item, and Qh the 
enthalpy source item. 

3) The mole fraction constraint equation 

= ( 2 - 3 ) 



4) The saturation constraint equation 



i;s.,=r 



The finite difference method (FDM) is used to discrete the above partial 
differential equations. More advanced methods [4] will be used. The sorting 
methods, such as the natural, red-black, and D4 sorting can be selected by 
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users to establish linear equations. Both the Gauss method and ORTHOMIN 
method can be used to solve linear equations. 

The serial NUMSIP consists of 258 modules. Because it has a good pro- 
gram structure, it is convenient for organization of structure of its parallel 
version PNUMSIP. To spare space, NUMSIP stores the data helds in a com- 
pact array. It decreases readability of programs and data access efficiency 
and increases the difficulty to parallelize them. It is not suitable for paralle- 
lization under a shared-memory parallel calculation environment. 

3 Parallelization of NUMSIP 

The aims to parallelize NUMSIP are a) to increase the model scale, b) to 
obtain the speedup and efficiency of software, and c) to develop a transplan- 
table and stable parallel software of thermal recovery simulation. Toward 
that end, various factors are synthetically taken into account. 

3.1 Software Parallelization 

Because 60-80% percent of calculation time is used for establishment and 
solution of linear equations, the prevailing strategy is to parallelize only these 
parts. However, it results in a problem; e.g., the model scale is limited by 
the size of accessible memory of a CPU. This problem becomes prominent 
in a parallel computation environment consisting of a workstation cluster. 
To solve this problem, a global parallel scheme is designed to parallelize a 
whole calculation process in DDM. It will beneht for improving speedup and 
efficiency of a program. To decrease data communication and simplify data 
structure, a non-overlapping irregular DDM is adopted. It is particularly 
suitable for the case that boundaries between sub-domain are along a natural 
fault, which can decrease data communication. For every sub-domain, the 
matured serial algorithm is used. 

3.2 Domain Decomposition 

To realize the domain decomposition, a 2D array Id(i,j) is introduced to 
mark different sub-domains. Its dehnition is given as follows: 

= k for grid {i,j) belonging to sub-domain k. (3.1) 

The boundaries between sub-domains can be traced from Id{i,j). Only 
the boundaries, which are not along natural faults, can be determined as valid 
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boundaries. For a valid boundary, there will be communication between the 
two sub-domains divided by this boundary. 

3.3 Data Structure 

On the basis of the original compact data array, two buffer arrays Bi and 
Be are introduced, respectively, to store the inner and outer boundary of a 
sub-domain. In the iteration calculation process, Bi is updated by the data 
field generated by the calculation at different iteration steps. It is transferred 
to the outer boundary buffer array Be of the neighboring sub-domain; see in 
Fig. I. 




Figl data sttucture and communication 
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Fig. 2. Domain decomposition of test problem Ti. 



1 


2 


3 


4 


5 


6 


7 


8 


9 


10 


11 


12 


13 


14 


15 


16 



Fig. 3. Domain decomposition of test problem T 2 - 



3.4 Data Communication 

According to the time variant characteristic of data to be transferred, the 
communication data are divided into three types, i.e., static data and slow 
and fast transient data. The data describing the geometric model of reservoir 
and rock property parameters are static data. The base values of the data 
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field at a time step in the iteration calculation process are slow transient 
data, and the increment values of the data field at an iteration step of the 
Newton-Raphson iteration are fast transient data. The block communication 
mode is used to transfer static data and slow transient data. But the non- 
block communication mode is adopted for transferring fast transient data to 
reduce communication overhead and improve communication efficiency. 



3.5 Time Step Control 

To ensure that the well data of all production periods can be safely loaded and 
satisfy the Newton-Raphson linearization condition, the minimum time step 
is selected. The domain is divided by using DDM and every sub-domain has 
a predicted next time step at every time step. To synchronize the calculation 
processes at different sub-domains, the time step control is determined as 
follows: 




o w t Ixiut 
fajit 
■wH thfaul 



serial 2 4 8 



CPU nimiber 



Fig. 4 Calculation time of test problem Tj 



a) predict time step T^k+i at sub-domain i, i = 1,2, . . . , N, 

b) determine the (k + l)th synchronic time step by 

tk+i = niin{fi_fc_|_i, f2,fe-i-i) • ■ • ) tN,k+i}, 

c) determine the (k -I- l)th time step ti^k+i of sub-domain i, ti^k+i = tk+i- 

3.6 Linear Equation Solution Algorithm 

The block Jacobi iteration algorithm is adopted to solve linear equations. The 
linear equations are established in the FDM and Newton-Raphson method. 
They are presented as follows: 
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where Xi, X 2 , ■ ■ Xpj, respectively, are solution vectors of data fields of sub- 
domains Gi, G2, ■ ■ ■, Gn- To solve these linear equations, an initial solution 
vector is given as = [Xj', . . . ,X^]*. The fcth iteration value of solution 
vector can be obtained from 

i-l N 

AuX': = B,-Y,A,,X';- Y, A,,xY- ( 3 . 3 ) 

i=i i=i-i-i 




CPUiKarber 



Fig. 5 calculation time 0 f test problem T 2 without fault 



The modification work of NUMSIP focuses on I/O. More than ten modules 
are modified and 24 new modules are added to realize parallelization of I/O, 
tracing of boundaries, establishment of linear equations, well distributions, 
and data communication. 
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CPU number 

Fig.6 Comaration of efficiency of PNUMSIP tested 
on IBM sp2 and Dawn lOOOA 



4 Test and Calculation Results 

4.1 Hardware Environment 

The IBM SP2 used for testing PNUMZIP is a distributed-memory computer, 
which consists of IBM RISC system/6000 workstations, high performance 
switch board, control workstation, and SP ether network. Its CPU clock 
frequency is 66.7 MHz. The peak calculation speed is 226Mflops. The com- 
munication bandwidth of its high performance switch board is 40Mbps and 
the bandwidth of its SP ether network is 1.25Mbps. It has 28 calculation 
nodes. Four of them are wide nodes and the rests are narrow nodes. A 
wide node has 128M memory and a narrow one has 64M memory. Parallel 
calculation process can run on only narrow nodes. 

Dawn lOOOA used for testing PNUMSIP has a structure similar to that 
of IBM SP2. It has eight nodes used for calculation. The clock frequency of 
CPU chips is 200 MHz. A node has 256M memory and 4.3G hard disk. The 
communication bandwidth of its high performance switch board is 10Mbps. 

4.2 Test Results and Analysis 

The test problems are cases used for study on well group schemes. One well 
group consists of 8 wells. Because of the limit of memory size of the used 
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parallel computers, the data model size of test problems Ti and T 2 , respec- 
tively, consists of only 8 and 16 well groups. Their grid sizes, respectively, 
are 96 x 12 x 10 and 96 x 24 x 10 grids. The deposition of well groups of these 
two problems are given in Figs. 2 and 3. There is a fault between sub-domain 
4 and 5. 

Tables 1 and 2 give the experiment results of T\ and T 2 tested on IBM 
SP2, where WOF and WF indicate without fault and with fault, respectively. 
Figs. 4 and 5 are the calculation time of these two test problems. It can be 
seen that a superlinear speedup can be obtained on IBM SP2, if the number 
of CPU is not more than ten. However, the efficiency of PNUMSIP decreases 
seriously for T 2 , compared with T\ . 



Case 


CPU number 


Serial 


2 


4 


8 


WOF 


Wall-clock time (s) 


4180.3 


1674.9 


797.6 


464.4 


WOF 


Speed up 


1.000 


2.496 


5.241 


9.002 


WOF 


Efficiency 


1.000 


1.248 


1.310 


1.125 


Case 


CPU number 


Serial 


2 


4 


8 


WF 


Wall-clock time (s) 


4180.3 


1387.1 


810.6 


465.8 


WF 


Speed up 


1.000 


3.014 


5.158 


8.974 


WF 


Efficiency 


1.000 


1.507 


1.289 


1.122 



Table 1. Experiment results of Ti on IBM sp2. 



CPU number 


Serial 


2 


4 


8 


16 


Wall-clock time (s) 


15059.3 


6223.9 


4400.4 


2492.4 


5104.4 


Speed up 


1.000 


2.420 


3.422 


6.0422 


2.950 


Efficiency 


1.000 


1.210 


0.856 


0.755 


0.184 



Table 2. Experiment results of T 2 on IBM sp2. 

The reason for the superlinear speedup can be explained by the paralle- 
lization of I/O. For the serial software NUMSIP, all the well data are loaded 
by one CPU. But for the parallel software PNUMSIP, the well data are si- 
multaneously loaded by a number of CPUs. The total cache buffer size used 
by PNUMSIP is much greater than that used by NUMZIP. Therefore, the 
input time for PNUMSIP will be much shorter than that for NUMSIP. Table 
1 shows that the parallel efficiency is improved for a case that the boundaries 
between sub-domains are along natural faults, only if all the boundaries bet- 
ween sub-domains are not valid boundaries. The total length of boundaries 
between sub-domains of T 2 is generally greater than that of T\ and it results 
in more data communication and more modifications of linear equation co- 
efficient matrices, which leads to the increase of iteration times of the linear 
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equation solution. Therefore, the parallel efficiency of T 2 is less than that of 
Ti . The rapid increase of data communication and iteration calculation times 
can also explain why the parallel efficiency drops seriously, as the number of 
CPUs becomes too large. 



Case 


CPU number 


Serial 


2 


4 


8 


WOF 


Wall-clock time (s) 


1238.8 


1244.9 


741.6 


351.3 


WOF 


Speed up 


1.000 


0.995 


1.670 


3.526 


WOF 


Efficiency 


1.000 


0.498 


0.418 


0.441 



Table 3. Experiment results of Ti on Dawn lOOOA. 

Table 3 is the results of Ti without faults tested on Dawn lOOOA. Fig. 4 
is the comparison of the efficiency of PNUMSIP obtained on IBM SP2 and 
Dawn lOOOA with case Ti. It shows that the speedup and efficiency of PN- 
UMSIP running on IBM SP2 is much higher than those of PNUMSIP running 
on Dawn lOOOA with the same test problem. But the calculation time of PN- 
UMSIP running on Dawn lOOOA is less than that of PNUMSIP running on 
IBM SP2. Because the clock frequency of CPUs on Dawn lOOOA is faster 
than that of CPUs on IBM SP2, as seen in Section 4.1, the calculation speed 
of Dawn lOOOA is faster than that of IBM sp2, so the calculation time of 
Dawn lOOOA is less than that of IBM SP2. Compared with IBM SP2, the 
communication speed of Dawn lOOOA is too slow, as seen in Section 4.1. It 
affects the iteration calculation process of PNUMSIP. Because the non-block 
communication is used to transfer fast transient data and the data communi- 
cation delay is mainly caused by the low communication speed, the iteration 
solution values of neighboring sub-domains used for calculating new iteration 
solution values may not be updated in time. It slows down the iteration cal- 
culation speed and results in a low speedup and efficiency of parallel software. 



5 Conclusions 

The following conclusion can be drawn from the test results of PNUMSIP. 
The parallelization of benefits the efficiency improvement of parallel software 
of reservoir thermal recovery simulation. The data communication band- 
width of computers is an important factor which affects the speedup and 
efficiency of PNUMSIP. PNUMSIP is a stable and efficient parallel software 
of reservoir thermal recovery simulation. 
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Abstract 

In this paper a class of lattice Boltzmann models with the energy 
equation for simulating fluid thermodynamics are studied. The features 
of this class of models are that the discrete velocity set consists of 
multi-speed velocities and the internal energy of fluid is introduced 
by a multi-speed. Therefore, the energy term appears in the local 
equilibrium distribution functions of these models. Two examples are 
given in this paper. One is a ID model and the other is a 2D model, 
which are used to model a shock wave tube problem and the Benard 
convection problem, respectively. 



KEYWORDS: lattice Boltzmann model, energy equation, shock wave tube, 
Benard convection 



1 Introduction 

The lattice gas automata (LGA) method has attracted considerable attention 
during the last several years in both modeling physical systems and solving 
partial differential equations. In more recent years there has been a trend of 
using the lattice Boltzmann (LB) scheme instead of the lattice gas automata 
method. Unlike the LGA method in which one keeps track of each indivi- 
dual particle, in the lattice Boltzmann approach we are only interested in 
the particle distribution function. While retaining the advantages existing in 
the LGA, such as parallel implementation, simple programming, and clear 
physical images, the LB method is more efficient and accurate in computa- 
tion in an essentially noise-free manner 0 and has been applied in many 
areas such as flows in porous media and magneto-fluids. However, most of 
the models previously considered have not involved the energy equation. In 
this paper we discuss a class of lattice Boltzmann models with the energy 
equation by introducing multi-speed velocities to discrete velocity sets. Then 
particles can exchange internal energy during interactions. Therefore, the 

Z. Chen, R.E. Ewing, and Z.-C. Shi (Eds.): Proceedings 1999, LNP 552, pp. 208-^^^ 2000. 
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energy term appears in particle distribution functions (i.e., the local equi- 
librium distribution function). In §2, a general description of this class of 
models is given. In §3, two examples are given; one is a ID model and the 
other is a 2D model. Numerical experiments are made in §4. The two models 
are used to model a shock wave tube problem and the Benard convection pro- 
blem, respectively. The results show that the models can effectively simulate 
fluid thermodynamics. 



2 A Class of Lattice Boltzmann Models 

Divide the domain of a flowing field in a d-dimensional space into a regular 
lattice and the discrete velocity set V = {e^ : i = 1, ... ,6}. Let fi{x,t) be 
the particle distribution function with velocity at site x and time t. The 
model evolves according to the lattice Boltzmann equation 



fi{x + e„t+l) = fi{x,t) +fli, (2.1) 

and 

n, = --{Mx,t)-f!'^{x,t)), ( 2 . 2 ) 

T 

where Lli is called the collision term or the collision function and r is a 
relaxation factor (t > 1/2). According to the physical conservation law, fli 
must satisfy the conservation law of mass, momentum and energy 

h b 

= 0, ^L!,ei = 0, (2.3) 

i=l i=l i=l 

In (2.2), is the local equilibrium distribution function depending on the 
dynamic quantities of density, momentum, and energy at site x and time t. 
In the classical gas dynamics, is the Maxwell-Boltzmann distribution. As 
an approximation of the Maxwell-Boltzmann distribution, let 

= cq + ci(e* • u) + C2(e* • m)^ -|- cz^ + 04(0^ ■ uf" + . . . , (2.4) 

where u is the velocity of fluid and Cfc(fc = 0,l,...) are undetermined coeffi- 
cients, which depend on p and s that are the density and the specific internal 
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energy of the fluid, respectively, deflned as follows: 

P = E = E 
2 = 1 2=1 

/ou = E fi{x, t)e, = E t)e„ (2.5) 

2=1 2=1 

^ E - u)2 = i E 

i=l i=l 

Taking the zeroth, first, and second order moments of from (2.1) and using 
(2.3) and (2.5), we can obtain the mass, momentum, and energy equations 
of fluid 

|^+V(pu) = 0, ^^ + vn = 0, ^^+VQ = 0, (2.6) 

where II is the momentum-flux tensor of fluid, E = e+^u^ (the energy), and 
Q is the energy-flux vector, which are deflned as 

b 1 ^ 

Hck/3 — fi{x , Qol — 7 ^ ^ ^ (^0 i,^i')afi(,X,Lj. {2,.7^ 

i=l i=l 

To derive the Navier-Stokes equation, II must be isotropic. To obtain II 
must be combined with some concrete models. In the next section we present 
two example models; II is obtained approximately by using the Chapman- 
Enskog expansion j2| and properly selecting the local equilibrium distribution 
function like (2.4). 



3 Thermodynamic Models 

In the thermodynamic motion, fluid particles must exchange energy. Hence, 
to simulate the thermodynamic feature, modules of ei(i = 1,2, ... ,b) in the 
set V of LB models should not be the same. 

3.1 A ID Model 

For the problem of ID flows, the straight line is divided into a uniform lattice 
and the step size is unit one. Then the set V is chosen asV = {—2,— 1,0, 1,2}, 
which contains five velocities and three speeds, 0, 1, and 2. According to the 
speed cr((T = 0,1,2), and /E are rewritten as e^i and respectively. 
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Now, we choose as follows: 
fo’^ = P(1 - £) - 

fu = \P^ + \p{^ - e)(ei* • u) + \pv? - \p{eu ■ u)^ - \pu^, (3.1) 

/If = \P^ + ^P(l + 2£)(e2i • u) + + 3^p(e2i • u)^ + 

with z = 1, 2. Taking the Chapman-Enskog expansion of the particle dis- 
tribution function /^z by the local equilibrium distribution function we 
have 

+ + (3.2) 

where Likely, II and Q possess the same expansion after using 

(2.7): 



n = -h n(2) . . . , q^q(o) + qC) + qC) + (3.3) 



Under the zeroth order approximation, we have 

= 2pe + pu^ = p + pu^, QA'> = pEu + pu, (3.4) 



with p = 2pe (the pressure). Consequently, the Euler equation is obtained: 
dp ^ a(pu) a(pu) ^ d{p + pu^) d{pE) ^ d{pEu + pu) 

If taking the expansion to the first order term of II and Q, we can have the 
Navier-Stokes equation and the heat conduction equation 



9{pC . 

dt 

djpE) 

dt 



d(p-id) _ _dp I d (,,du\ 

~ dx dxyPdxL 






dx 

(pE 

dx 



(3.6) 



dx 



with p = 2pe{2r — 1) and k = 7ps{2T — l)/3, which are the viscosity and the 
heat conductivity respectively. 



3.2 A 2D Model 

As for a flow problem on the 2D plane, the field is divided into a uniform 
square mesh and the step size is unit one in both directions. Then the set V is 
chosen as U = {(0, 0), (1, 0), (v^, v^), (0, 1), (-1, ^2), (-1, 0), (-^2, -^2), 
(0,-1), (-\/2, — -\/2), (2,0), (0,2), (—2,0), (0,-2)}, which contains thirteen 
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velocities and four speeds, 0, 1, \/2, and 2. Now, the f^f’s are chosen as 

fii = ~ 1 ^^) +p(i~ ■ ") + 

+ ZP<^V2^ ■ u) + lp(^V2^ ■ - iiP^^ + yi^V2^ ■ ")^ 

f 2 i = - ^e) - piii - ^s)ie 2 i ■ u) - ^pu^ + ^p{e 2 i ■ u)^, 

(3.7) 

with i = 1,2, 3, 4. By the same way as above, taking the Chapman-Enskog 
expansion to the zeroth order term and the first order term, we can obtain 
the Euler and Navier-Stokes equations, respectively, 

dt(pua) + dp(pUaUp) = -daP, dt{pe) + dp{peup) = -dp{pup), (3.8) 



and 

dt(pua) + dp{pUaUp) = -daP + dp[iydp{pua) + Cda(pup)], 

dt{pe) + dp{peup) = -dp{pup) + da{KpdaS) + da[vupdp{pua) + C,da{pv?)\, 

(3.9) 

where v = e(2r — 1)/2, k, = e(2r— 1), and C = 2, which are the shear viscosity, 
heat conductivity, and bulk viscosity, respectively. 

3.3 Numerical Experiments 

To examine the models presented above, we apply them to two problems, 
which are a shock wave tube problem and the Benard convection problem. 

The ID model in §3.1 is applied to the shock wave problem with Sod’s 
initial conditions 0- 

{pl,ul,Pl) = (1.0, 0.0, 1.0), (pr,ur,pr) = (0.125,0.0,0.1). 

Fig. 1 gives the results at time step 30. The numerical results show that the 
LB model gives the correct position and the resolution of the shock wave and 
contact discontinuity are also good. 

The second model is used to the Benard convection problem which comes 
from a thermal convection experiment of a fluid that Benard did in 1900. 

A container is filled up with a fluid. The distance between the upper and 
the lower plates of the container is much smaller than the width and the 
length of the plates. Heating the lower plate continuously, the temperature 
gradient arises in the fluid. If the temperature difference between those two 
plates, AT = T; — T„, is not large, the fluid is macroscopically still in the 



A Class of Lattice Boltzmann Models 



213 







Figure 1: The shock wave tube problem. 

stationary state. Eventually, when AT exceeds a critical value, the stationary 
state of the fluid is broken down suddenly and is replaced by the convection 
state. The convection circuits, i.e., the Benard pattern, can be seen in the 
container. 

The mathematical description about the Benard convection problem is 
the Boussineq equations | 2 | 

V-u = 0, 

+ (u • V)u) = -Vp + Au + Raa9X, 

If + u • V6» = u • A + A6I. 

The boundary conditions are 

u = 0, 9 = 0, X 3 = 0, 1, 

where A = (0, 0, 1) is a upper vertical unit vector, 9 denotes the difference of 
temperature distribution between a certain initial state of the fluid and its 
perturbed state. P = v/k\s called the Prandtl number and Ra = 
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£j Rayleigh-Bernard Convection 

Velos ity expands: 20^ 0 200 w*100:0 OJ 200 TOP.T1 *100; 0 D _h 

BOTTOM_T2 *100:0 OJ 1000 J_0 *100:0 OJ 100 



T=B14 w=0,05 71=0,00100 72=0.00500 u_0=0,00000 




Figure 2: The Benard pattern. 

the Rayleigh number in which v, k, g, a, and d are the shear viscosity, the heat 
conductivity, the gravity acceleration, the coefficient of heat expansion, and 
the thickness of the fluid layer, respectively. This is a bifurcation problem. 
As the parameter Ra changes, the solution of the problem also changes. 
When the Rayleigh number Ra > 1708, the heat conduction of the fluid layer 
changes into the Benard convection. 

In our simulation, a rectangle container is chosen and it is divided into 
a square lattice of 80 x 20. The initial state of fluid in the container is 
given randomly and the boundary conditions are treated as in |S| • The result 
at time step 763 is shown in Fig. 2 and a very good convection pattern is 
obtained which is in a good agreement with the result of experiment. 

Acknowledgments. This work was supported in part by National Natural 
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Zhibo Ma Jianshi Zhu 



Abstract 

To compute the flow field in solid rocket ramjet (SRR) in which the 
chamber has a complex boundary, a block implicit algorithm (BIA) 
had been developed. The boundary conditions of three-dimensional 
steady-state Navier-Stokes (NS) equations were treated by modifying 
the discrete equations and the grids were generated through an alge- 
braic way. These methods have been put into practice and proved to 
be valid and efficient in the computation of flow field in the chamber. 
The technique developed here applies to similar problems in porous 
medium flows. 



KEYWORDS: rocket ramjet, numerical simulation, block implicit algorithm 

1 Introduction 

For the purpose of numerical study on the reaction flow field in SRR, the 
authors of this paper have developed a computer program in which the BIA 
has been adopted [2-4]. In this program all the control equations are in a 
cylindrical coordinate system and accordingly the domain of flow field is a 
simple column. 

In the real SRR or so-called ducted rocket, the dome plate always presents 
the shape of a ellipsoid and the fuel nozzles are inserted into the forepart of 
the chamber. Such designs of structure make the held boundary complica- 
ted. For the sake of accommodating these situations, one may think that the 
curvilinear coordinate system would be the first choice, but this may not be 
the best idea because it increases the computational cost. In fact, the most 
part of the long chamber is in the shape of a column and it is still advantage- 
ous to use the cylindrical coordinate system if the boundary conditions could 
be processed properly. In this paper the coordinates of grids are calculated 
with an algebraic method and the treatment of boundary conditions of NS 
equations is discussed corresponding to these specific grids. 

Z. Chen, R.E. Ewing, and Z.-C. Shi (Eds.): Proceedings 1999, LNP 552, pp. 216-^^^ 2000. 
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2 Generation of Grids 

A new program has been developed to generate automatically grids. In the 
program the grids are generated in such a manner that the resulting com- 
putational domain can cover the actual domain. The boundary grids at the 
forepart of the chamber present the profile of ladders. This treatment can 
lead to a high accuracy in numerical simulation. 

To eliminate the false numerical waves of pressure, a stagger-mesh has 
been adopted. In the stagger-mesh system, scalars are stored on the main 
grids and velocities are stored on the interface of the main grids. For three- 
dimensional problems, each control volume (or a main grid) has six interfaces. 
These interfaces can be divided into three kinds, each of which is perpendicu- 
lar to the axes of x, r, and 6, respectively. During the process of calculations, 
each interface is labeled by a predefined integral number, which is stored 
in arrays named judx(i,j,k), judr(i,j,k) and judth(i,j,k) corresponding to the 
three kinds of interfaces. The integral numbers representing the characteri- 
stics of interfaces are defined as follows: 



“0” means the interfaces are on the wall; 

“1” means the interfaces are in the inner field; 

“11” means the interfaces are at the front air inlets; 

“12” means the interfaces are at the back air inlets; 

“21” means the interfaces are at the first fuel nozzle; 

“22” means the interfaces are at the second fuel nozzle; 

“23” means the interfaces are at the third fuel nozzle. 



When calculating flow fields, we must follow the conservation of mass of 
air and fuel, so in the process of discretization, the areas of interfaces must 
meet the conditions 

^ Ax{i,j,k)cosAi+ ^ Ar{i,j,k)sinAi=AREAi, 

j udx {i ,j ,k) — ll judr{i,j,k) = ll 

Ax{i,j,k)cosA2+ Ar{i,j,k)sinA2 = AREA2, 

j udx {i ,j ,k) — 12 judr{i,j,k) — 12 



where AREAi and AREA 2 denote the cross-section areas of front and back 
air inlets, respectively. The grids on the fuel nozzles occur in the same way. 
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3 Outline of Block Implicit Algorithms 

The steady control equation on a finite volume can be generally described in 
an integral form 

4>) ■ dS = j Srj,dV, 

AS 

where </> denotes velocities or scalars. 

When using a block implicit algorithm to solve the steady-state NS equa- 
tions, the symmetrically coupled Gauss-Seidel (SCGS) technique is adopted 
in sweeping through the grids to smooth u, v, w, and p. For the grid with 
indices (i,j,k), the variables to be smoothed can be described as a column 
vector [1] 

X = [xi X 2 Xs Xi X^ Xq X-j], 
and X is seperated into two parts as 

^new ~ ^old T 

when it is updated. The corresponding discrete equations of AX are 

AAX = B. (3.1) 

It can be easily solved through a direct method. 

4 Treatment of Boundary Conditions 

To solve the NS equations of compressible flow in the chamber of SRR, com- 
putational conditions must be given by: (a) velocities and stagnation tempe- 
ratures of air at the exit of inlets and those of fuel at the exit of fuel nozzles 
and (b) pressure at the exit of the chamber. 

Generally, the mass flows of air and fuel are design parameters, so their 
velocities are unknown and must be recomputed continually to meet the given 
flows according to the densities at the air inlets and fuel nozzles. The density 
is obtained from the ideal gas state equation 

p = p/RT, 

where the temperature is computed from the thermal enthalpy H and special 
heat Cp such as 

T = H/Cp. 
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Like the velocities on the entrance of the chamber, the pressure at the exit 
of the chamber (or the so-called Pout) is also an unknown quantity. The 
Pout must be continually updated along with the iteration cycles on u, v, 
w, and p to meet the real boundary conditions. In other words, Pout is 
regarded as a constant in each cycle mentioned above and finally determined 
by other predefined parameters and physical states such as the stagnation 
pressures of air and fuel, the throat area of the chamber nozzle, and the 
combustion efficiency. To avoid divergence in solving the NS equations using 
BIA, care must be taken to treat the pressure, density, and velocity at the 
boundaries. With regard to the problem about the SRR chamber, five kinds 
of boundaries are encountered. Except at the exit of the chamber, a zero- 
derivative condition is used on the variables p and p at the boundary faces. 
At the exit of the chamber, pressure is given by Pout as 

p{imax - 1 , j, fc) = Pout, 

and the density is obtained from the ideal gas state equation. 

The velocities can be computed through their discrete equations described 
as follows: 

(a) At the entrance of air and fuel 

At the entrance the velocities can be regarded as constants within each 
iteration cycle on u, v, w, and p, but they can not be allowed to be equal to 
the constants directly or it leads to divergence. In fact, the velocities on the 
boundaries are allowed to fluctuate, but finally they converge to the practical 
values. For example, at the entrance of air, if the three velocity components 
are Uair, Vair, and Wair, then their discrete equations corresponding to equa- 
tion ( 3 . 1 ) are revised as 

k) + airAp{i,j, k) = an[Uair ~ u{i, j,k)oid], 
a 33 Av{i,j,k) + a 37 Ap{i,j,k) = 033 [Kir - k)oid], 

a55Aw{i,j, k) + a5rAp{i,j, k) = a^^lWatr ~ k)oid\- 

(b) At the solid wall face 

As the non-slide condition is adopted, the speed of fluid is zero on the 
wall, so the discrete equations of u{i,j, k), v{i,j, k), and w{i,j, k) are 

aiiAu{i,j,k) + ai7Ap{i,j,k) = -auu{i, j, k)oid, 

033 Au(i, j, k) + a3rAp{i, j, k) = -033^(1, j, k)oid, 
ar, 5 Aw{i,j,k) + a 57 Ap{i,j,k) = j, k)oid- 
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(c) At the symmetric plane 

In this situation, w{i,j, k) or w{i,j, A: + 1) become zero, so their discrete 
equations are 

a55Aw{i,j, k) + 057 Ap(i, j, k) = k)oid, 

aeeAw{i,j, k + l) + aerAp(iJ, k) = -aeew^i, j, k + l)oid- 

(d) At the axis of the chamber 

At the first layer of grids near the axis of the chamber, the SCGS operator 
can not be applied, so the velocities are approximated by using those of the 
second layer of grids: 

u{i, 1, k) = u(i, 2, k), 

v(i, l,k) = cos(0.5(0(i,j, k) + e(i,j,k+ l)))vo(f), 
w{i, 1, k) = sin{9{i,j, k))vo{i), 

where 

vo{i) = |i?o(f)|- 

In the above formula, voii) is the velocity component projected in the (r,9) 
plane of the fluid, the position of which is on the axis of chamber. 

To keep the conservation of mass flow, the variables of v{i, 2, k) must be 
obtained from the mass continuity equation. 

(e) At the exit of chamber 

The flow field can be regarded as fully developed at the exit of the cham- 
ber; therefore, the derivatives of velocities along with the chamber axis are 
considered to be zero. They are described as 

u{imax,j, k) = u{imax — 1, j, k), v{imax,j, k) = v{imax — 1, j, k), 
w{imax, j, k) = w{imax — 1, j, k). 

5 Numerical Example 

To verify the quality of the treatment of boundary conditions, we have calcu- 
lated a problem of a model ramjet that has two front air inlets and two back 
fuel nozzles. The structure is symmetrical in (r, 0) plane. The computation 
domain of 0 is 0° ~ 90°. The mesh contains 34 x 17 x 11 grid nodes. To 
solve the compressible problem (neglecting the chemical reaction), it spends 
PC586 about 3.0 minutes when the norm of residuals converges to a level less 
than 10“®. The normalized error of flow at the exit of chamber is less than 
0.3%. 
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6 Conclusions 

The numerical simulation on reaction flow in SRR is a burdensome task that 
spends much computation time, so it is important to develop a numerical 
method to solve efficiently this practical engineering problem. Through the 
approach described above, the grids are generated rapidly by an algebraic 
way. With the special treatment of boundary conditions, the NS equations 
can be solved quickly in the cylindrical coordinate system and the conserva- 
tion of mass flow can be well observed even for the chamber with a large L/D 
ratio. The computer program developed under the guidance of these ideas 
plays a useful role in the research and development on SRR. 
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Abstract 

Some mixed finite element methods for a non-Newtonian flow pro- 
blem are discussed in this pape. A new variational formulation is pro- 
posed to handle the Newtonian-viscosity dependent problem arising 
from a three-field version of the White-Metzner model. It allows us 
to derive Newtonian-viscosity independent error bounds and to facili- 
tate the choice of stable finite element spaces. Moreover, a continuous 
stress approximation can be obtained without any extra effort. The 
enhanced-strain-oriented quadrilateral Wilson-Pi element is used for 
approximating the two-field version with or without a modification on 
the variational formulation. Stability is justified and quasi-optimal 
error bounds are presented. 



KEYWORDS: mixed finite element, Wilson element enhanced strain method, 
Korn inequality 

1 Introduction 

Non-Newtonian flows are extensively involved in a number of problems such 
as the production of oil and gas from underground reservoirs. Such kind 
of problem has an intrinsic, two-scale coupling which poses some challenges 
in the numerical simulation, and the difficulty emanates from the nonlinear 
viscosity. Numerical simulation of many problems in this field is still open. 
In this paper we consider a simplified model, which possesses some intrinsic 
characters of the original problem. It is believed that an effective algorithm 
for the original problem should be effective for the simplified model considered 
herein. 

Our assumptions for the simplification are the following: only steady-state 
flows are take into account, inertial effects are ignored, the Deborah number 
is small, and, consequently, we only consider quasi-Newtonian fluids. 

Z. Chen, R.E. Ewing, and Z.-C. Shi (Eds.): Proceedings 1999, LNP 552, pp. 222-^^^ 2000. 

(c) Springer-Verlag Berlin Heidelberg 2000 



Stable Conforming and Nonconforming Finite Element 



223 



The line of the paper follows: in §2, the White-Metzner model is presen- 
ted and a new variational principle is proposed to deal with the Newtonian- 
viscosity-dependent problem and in §3, the enhanced-strain-oriented quadri- 
lateral Wilson-Pi element is presented and analyzed. 

2 Model of White-Metzener and Newtonian- 
Viscosity Dependent Problem 

We give a brief description of a White-Metzner model for the viscoelastic 
flow. We refer to |Z] for a more general description. 

Let n be a bounded convex polygonal in TZ^{N = 2, 3) with the Lipschitz 
boundary L. TZ^ is equipped with Cartesian coordinates Xi,i = 1, . . . , N. For 
a function u, is written as and the Einstein convention for summation 
is used. 

For a scalar function p, the gradient of p is a vector Vp, and ( Vp)i = p^i. If 
q is another scalar function, we denote (p, q) = pq. For a vector function u, 
the gradient of u is a tensor Vu, (Vu)^ = u^j, divu = u-V = For 

a tensor function u, V • cr is a vector function, (V • a)i = aijj, a : t = UijTij, 
|crp = a : a, and (cr, r) = : t. 

To describe the flow, we use the pressure p (scalar), the velocity vector 
u, and the total stress tensor atot- Su = |(Vu -h V^u) is the rate of strain 
tensor, and £’n(u) = is the second invariant of Evl. 

A White-Metzner type model is described by the constitutive equations: 

CTtot = cr -I- (Tat - pi, {l)ij = Sij, (Jn = 2pE\i, a = 2p(£'n(u))fu, 

where p is the Newtonian part of the viscosity and 77 is the viscosity function 
for the viscoelastic part. The velocity u must satisfy the incompressible 
condition 

divu = 0. 

In this paper we consider the stationary creeping flow m- The fluid is 
subject to a density of force f. Then the momentum equation is 

-divCTtot = f. 

Two classical laws for rj are the Power law and Carreau’s law CH: 

(r — 2) 

Power law: rjp{z) = ^gz 2 , r > 1, p > 0; 

Carreau’s law: Pc(z) = (ryo - ?7oo)(l + Az)“, 0 < poo < Po, > 1. 
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Sobolev spaces are needed. T = [L’’ (fl)] ^ ^ = {r = (ry) | nj = 

Tji , T,j € i,j = with the norm ||t||t = (/nlrr')^. (•,•) 

denotes the inner product of X = [Wq’^ {^)]^ , M = Lq{^) = {q £ u' [M) \ 
f^g = 0}. X and M are equipped with the norm ||v||x = (/q |fv|’’)^ and 
IkllM = (/n respectively. It is easy to see || • ||x is an equivalent norm 

over X. We also denote T' and X' the dual space of T and X, respectively, 
and (, ) the dual multiple between X and X'. For 1 < r < 2, we must modify 
the definition of X and define Xi = X fl [Ffg but we still denote it as 

X for simplicity . 

Some progress has been achieved in the finite element approximation of 
this problem. In 1992, Baranger, et al. |2j gave the first finite element appro- 
ximation based upon a special case of the following variational formulation 
with a = 1. However, abstract error bounds they obtained are /i-dependent; 
i.e, the error bounds are deteriorated as the Newtonian viscosity approaches 
zero. Furthermore, no finite element space pair is available. Meanwhile, the 
continuous approximation for the extra stress is widely used in engineer lite- 
ratures; however, a fairly large finite element space is needed to achieve this 
goal (13 ED) that would cause an extra cost and lose the accuracy. Recently, 
the so-called EVSS and its modification [B| are proposed to attack this pro- 
blem, but it needs an extra variable that would increase the cost and even 
seriously it would lead to unsymmetric algebraic systems. 

Below a new variational formulation (P ^]) is proposed to solve the 
above problems. Some operators are needed to introduce our method: 

B:X^X', R(u) = 2 t 7 (£:u(u))£:u, 

: T X X X M ^ T' X X' X M', 

Z : T X X X M T' X X' X M', 
x = (a,u,p), y=(r,v,q), 

CHa(x),y) = a{A{a),T) - a{T,£\i) + a{a,£\i) -I- (1 - a){B{\i),£v) 
-|-2^(£iu,£’v) - (p, divv) -I- (g,divu), a£ [0,1], 

< l,y >=< f,v > . 



We define the problem: Problem PL. Find x G T x X x M such that 
(Ff„(x),y)=<l,y> VyGTxXxM, VaG [0,1]. 



It can be cast into the saddle point problem: Problem PL. Find x G 



Stable Conforming and Nonconforming Finite Element 



225 



T X X X M such that 

a{A{a),r) — a{T,£\i) = 0 Vr G T, 
a(cr,fv) + (1 — a)(i?(u),fv) + 2^(fu,fv) 

— (p, divv) =< f,v > Vv G X, 

(divu,( 7 ) = 0 yq&M, aG[0,l]. 

When a equals 1 or 0, Problem Tf degenerates to the problem which 
has been discussed in B or the model discussed in the next section, 
respectively. We only consider the case a G (0, 1) in this section. 

Based on the classic theory of a mixed variational problem (see |5| for the 
linear case and m the for nonlinear case), the basic vehicle for the analysis 
of Problem % is the following two inequalities, which concern the monotony 
and continuity properties of A(a). 

Though the explicit formula for A(a) is not available when 77 = 77 ^, we 
are still able to prove the monotony and continuity properties of A(a) in this 
case. 



Theorem 2.1 m) There exists a constant C independent of a and r such 
that, for any i5 > 0, 

(A{a) - A{r),a - t) > C\a - rp+‘^(l + |cr| + 

|A(a) - A(r)\ < C\a - rr^(l + |a| + \r\r'-^+^. 

The proof of these inequalities is tremendously long since in this case 
the explicit form of A(a) is unknown. The baisc idea in our proof is the 
prolongation trick and the application of the known results for A(cr) when 
11 = Vp- 

We assume that the triangulation Ch is a regular partition of fl] the quasi- 
uniformity of Ch is not necessary unless we state it clearly. Let T/j, X/j, and 
Mh be the finite element space: Th G T, X/j G X, and Mh G M. 

Problem Hh- Find Xh = (ah,Uh,Ph) G T;i x X^^ x Mh such that 

{'Ha{y^h),y) =< l,y > Vy g T,, x X?, x M, Va G (0,1). 



The well-posedness of Problem %h depends heavily on the B-B inequality 



3 Ph(r) > 0, iiif sup > Ph{r). 



gGMfevfcGXfc ||V/t||x|| 



M 



Choosing X/j and Mh satisfy the above B-B inequality is a delicate task, 
though there are many examples for r = 2 (see uni and 0 for a review). 
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Nevertheless, there exist far fewer examples for the case r ^ 2. In a 
general criterion is given and exploited to prove that almost all the stable 
pairs for the case r = 2 is also stable for the case r ^ 2. 

Remark 2.1 Note that the B-B inequality for (a, u) and (ah, u^) is no longer 
needed either for Problem T-L or for Problem 'Hh- An enhanced K -ellipticity 
is introduced in a natural, reasonable and unifying way. It is known that 
there are many finite element space pairs satisfying the B-B inequality eit- 
her for (a,u) or for (u,p), but they do not satisfy the two B-B inequalities 
simultaneously m 

To derive error bounds, we adopt the abstract quasi-norm introduced in 
0, which is very useful. Let (cr, u) G T x X be the solution of Problem (H). 
Then for (r, v) G T x X, we define the quasi-norms 

l^lr' = In p' = max(2,r'), 

\^^\r = In |fvp(6i-f |£:u| -h \£v\Y'-'^, p = max(2,r). 

If the usual approximation properties for the finite element spaces Th, ^h, 
and Mh is assumed and the B-B inequality hlods, then we have the following 
error bounds. 

Define k G [r, r-|-(2 — r)0] for r G (1,2] and k G [r, r-l- (r — 2)0] for r G (1, 2] 
below. 

Theorem 2.2 Let a G (0, 1) and (ah,Uh) € Th x Vh be the unique solution 
of Problem Hh- 

(i) If r ^ [1,2], we have 

Ik-cr/illx < ||u-u;,||i^r-h ||u-u,,||x2 < C/lT. 

In particular, when 9=1, we even have |]cr — atWh'^ < C\h^ . 

(a) If r € [2,oo), we have 

\Y - ah\\T < Ch^ , ||u - u/i|]xr < 

When 9=1, we even have 

jju - U/,||X 2 < \\a - ahWh-i <Ch^ . 

(Hi) Furthermore, if the mesh is quasi-uniformity, then 
Wp-PhWl-' rG (1,2], 

C = min 

\\p-Ph\\L-' <Ch^, rG[2,oo). 



( 2 . 1 ) 



Stable Conforming and Nonconforming Finite Element 



227 



Remark 2.2 When r G (1)2], the error bound (2.1) for the pressure is de- 
teriorated. We find that the deterioration happens only in the limiting cases: 
(1) N = 2, k = 1, (2) N = 3, k = 1, (3)N = 3, fc = 2(r e (1, f )). If we 
assume p, = these three cases, the accuracy can he recovered. 

In fact, this kind of assumption on p, is realistic when p is very small. Re- 
calling that in this case the proper Sobolev space for the pressure is 
we only need to derive error hounds in Lg(ll). Since the norm on io(^) 
is weaker than that in Lq (12) in the present situation, we can expect to get 
p-independent error bounds; it is just the case. 

Corollary 2.1 With the same assumptions as in case (Hi) of Theorem 2.2, 
we have 

\\p-Ph\\L^<Ch^‘'^-^\ 0 = 0, \\p-Ph\\L^<Ch\ 9 = 1. 

Exploiting the special structure of our quasi-norm, we can get some error 
bounds with respect to stronger norm (see ini)- 

3 Wilson Element for Non-newtonian Flow 

In 1995, Reddy and Simo HH investigated the stability property of the en- 
hanced strain method, which is identical to the Wilson element method under 
uniform square meshes. More recently, Zhang has established the stabi- 
lity of the Wilson element and its variants m for the incompressible elastic 
problem over an arbitrary quadrilateral mesh with a minor modification of 
the original variational formulation. In H2!. the enhanced strain oriented ele- 
ment has been subject to a severe numerical test, which indicated that such 
kind of element performs very well even on relatively coarse mesh. Therefore, 
it is worthwhile to study the Wilson element in the present context. 

In this section, we study the convergence behavior of the enhanced strain 
oriented Wilson element for Problem R with a = 0, with or without a modi- 
fication on the variational formulation. We show that the same accuracy can 
be achieved compared to the conforming bilinear element for the variational 
formulation with a minor modification, and under the bi-section condition 
m the same assertion holds for the variational formulation without a mo- 
dification. 

The Wilson nonconforming finite element space |23| is 

{v e [L\n)f \ir = voFKG P{k) WK G C^, 
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where [Qi{K)]'^ C P{K) C [^ 2 ^]^. We write P{K) = [Qi{k)]^ + B{K), 
where B(i4T) contains the nonconforming part: 

B(i^) = Span(C^-l,??2_i)2. 



We denote the Wilson nonconforming element space as X^, and the Pi 
conforming element space as Mh, which is used for approximating the velocity 
and pressure, respectively. X/, can be decomposed into the conforming part 
"V h and the nonconforming part B/i: 

v„= {v^ e X I o Fk e lQi{k)]^ vx e C4, 

Bh= K G \ v'^ = v^oFKeBK VX G Ch}. 

Define the semi-norm 



l|v|U = (E Mm.ic)"=(E \^l\lr,K+ E 

KeCh K&Ch KeCh 

We list some geometric properties of an arbitrary quadrilateral mesh m- 



— ao + -I- 02?? -I- ai2^T], 2/^ — &o + + ^2?? + bi2^rj. 



DFKk^ij) 



Ol -I- 012?? 02 -I- Oi2^ 

bi + bi2rj 62 -|- b\2^ 



and the Jacobian of Fk is Jk{^, v) = det(DFK) = Jq + where 

= oi&2 — 0261, = 01612 — 01261, J2 = 012&1 — 02612- Denote the 

inverse of Fk by F^^. Then 



{DFk)-H^,v) 



1 



62 + 612C — 02 — Oi2^ 

-61-612?? ai+ai2V 



With these notation, for any v G we define fy, as follows: 

Jk^= ^(0.0)11 - ^(0,0)| = 62 || - 6i||, 

Jk^v= -^(O.O)i + ^(0,0)i| = - 02 i| + oiff. 

We define the modified divergence operator by 



divv = 



dvi 

dx 



dv2 



[5(u),£:v];,= {A{u‘^),£v'^)h + (^(u''),£:v^)),i, 

[divu,p]/, = (divu=,p) -h (divu^p)^,, [f,v];i = (f,v‘=). 



and 
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Problem Hh- Find (uh,Ph) G x Mh such that 

[A{vih),£yr]h - [divv, ph]h = [f,v]/t Vv e Xh, 

[divu?i,9]/i = 0 Mq&Mh. 

To get the well-posedness of Problem TLh, we need the following lemmas. 

Lemma 3.1 m) 

+ ||V^|U) < \\v\\h < C' 2 (|v‘=|i,r + \W^\\h) Vv e Xh, 

\W%,r<Ch\\v% VVGX^. 

This lemma can be proved by a combination of scaling trick and the 
equivalent norm theorem in finite dimensional spaces. 

Lemma 3.2 (cni) The discrete Korn inequality 

l|v|U<C||fv||o,. VvgX;,, 



and the B-B inequality hold 



3f3h{r) > 0 , 



. n divv,g 

mf sup — 

gGMhvGXh l|v|U|lg||M 



> Ph{r). 



The proof of this B-B inequality is long and technical; we refer to jl h) for 
more details. 



Theorem 3.1 If {\ih,ph) G X^ x is the unique solution of{Hif), then 
i) for r ^ [1,2], we have 

||u - u/,||,j < , \\p-ph\\M <Ch'^~^, 

a) for r G [2,oo), we have 

\\u-Uh\\h<Ch^, \\p-ph\\M <Ch"^. 

When 9=1, we even have 

||u-u,i||i_2 < Ch'i. 



Corollary 3.1 The nonconforming part of the approximate solution Uh G 
Xh tends to zero with the estimates 

ll^^llo.r + ^||u^||/t < Ch^~^^ Vr G (1,2], 

llu^llo.r + h\\ui\\h < Ch^+'^ Vr G (2,oo). 

Furthermore, 

||u - < C/i^ VrG (1,2], \\u~ul\\h<Ch^ VrG(2, oo). 
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As for the variational formulation without a modification, we only need 
the hi-section condition (see my- 

Bisection condition. The distance dK between the midpoints of two 
diagonals of K G Ch is of order 0{h\) uniformly for all elements AT as — >■ 0. 
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Abstract 

A new sequential solution method with selectively chosen primary 
variables is developed for modeling the enhanced oil recovery proces- 
ses. A mixed finite element method (MFEM) is used to solve one 
phase pressure and pseudo total-volumetric velocity simultaneously. 
An Eulerian-Lagrangian localized adjoint method (ELLAM) is used to 
solve each transport equations. Computational results for two- and 
three-phase multi-component fluid flow occurring in enhanced oil reco- 
very processes are presented, which show the strength of the method. 



KEYWORDS: compositional flow, mixed finite element method, Eulerian- 
Lagrangian localized adjoint method 

1 Introduction 

The objective of reservoir simulation is to understand the complex chemical, 
physical, and fluid flow processes occurring in a petroleum reservoir suffi- 
ciently well to be able to optimize hydrocarbon recovery. To do this, one 
must be able to predict the reservoir performance under various exploita- 
tion schemes. In many enhanced oil recovery processes, some chemicals are 
injected into the reservoir to mobilize the residual oil trapped in the re- 
servoir rocks. Thus, two or more fluids can flow in an immiscible fashion 
at certain times and in a miscible manner at other times. Different com- 
positional models have been developed to model accurately these processes 
by characterizing the composition of reservoir fluid using a finite number of 
components EJ ^ , leading to strongly coupled systems of nonlinear par- 
tial differential equations (PDEs) and constraining equations. The governing 
PDEs are basically of an advection/diffusion type with advection being the 
dominant process. Although diffusion or dispersion is a small phenomenon 

Z. Chen, R.E. Ewing, and Z.-C. Shi (Eds.): Proceedings 1999, LNP 552, pp. 232-^^^ 2000. 

(c) Springer-Verlag Berlin Heidelberg 2000 



Numerical Simulation of Compositional Fluid Flow 



233 



relative to advection, it is important for miscible flow regimes and may at 
times describe important capillary pressure effects in the immiscible flows. 
The constraining equations provide further information on the distribution 
of mass for each individual component in different phases. In general, they 
are strongly nonlinear, implicit functions of pressure and mole fractions for 
each component in different phases. Consequently, extremely large number 
of PDEs and unknown variables are involved. 

The combination of strong nonlinearities and close couplings between the 
equations and constraints cause severe numerical difficulties in the solution 
of the systems. Due to the enormous size of many field-scale applications, 
these equations cannot be solved in a fully-coupled, fully-implicit fashion, 
and some linearization techniques must be used to obtain the numerical so- 
lution. However, a blind linearization with little regard to the properties of 
the equations or the solutions can result in extremely large, ill-conditioned, 
nonlinear systems; the accurate solution of these equations can be extremely 
difficult and expensive. Choices of implicitness and decoupling of the equati- 
ons must be analyzed and treated with great care for these difficult problems. 
Secondly, quite large grid-spacings must be used in the discretization of PDEs 
due to the enormous size and complexities of the problems. The use of large 
grid-spacings in space-centered finite difference methods (EDMs) and finite 
element methods (FEMs) often yield numerical solutions with excessive os- 
cillations. On the other hand, upwind EDM and Petrov-Galerkin FEM can 
produce numerical solutions with severe numerical dispersion, and spurious 
effects related to the orientation of the grid I3IS1. 

The ELL AM was originally introduced by Celia, Russell, Herrera, and 
Ewing for the solution of (one-dimensional constant-coefficient) advection- 
diffusion PDEs in a mass conservative manner [^. It overcomes the prin- 
cipal shortcomings of many previous characteristic methods while maintai- 
ning their numerical advantages. Subsequently, ELL AM schemes have been 
successfully applied to linear transport PDEs, multi-component fluid flow 
problems in compressible media with wells, and many other problems (see 
uni uni and the references therein) . In this paper we develop a numerical me- 
thod for compositional flow by utilizing the basic properties of the flow more 
effectively. By choosing the primary variables properly, we ease the coupling 
and nonlinearity of the system. We then apply a sequential procedure to 
linearize and decouple the coupled system, and use an ELLAM scheme to 
solve each linearized transport equation and an MFEM to solve one phase 
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pressure and pseudo total-volumetric velocity simultaneously. Numerical re- 
sults for two- and three-phase multi-component flow problems are presented, 
which show the strength of the method. 



(i) The reservoir temperature is constant. 

(ii) There is no mass exchange between the water phase and the oil and 
gas phases. 

(iii) The water and rock are incompressible, which implies that the density 
Pw of the water phase is constant and that the porosity (/> of the rock 
is only a function of spatial variables. 

(iv) The viscosity of the water phase is constant. 

Since there exists mass exchange between phases, mass is not conserved 
within each phase. Nevertheless, the total mass of each component is conser- 
ved, and the corresponding mass conservation can be described by 



Here N is the number of hydrocarbon components. The superscript and the 
subscript refer to the component and phase indices, respectively. In Eqs. (2.1) 
and (2.2), the nomenclature is such that (or m* or rrig) represents the 
moles of the corresponding component per pore volume; rrd = m* -I- m* is the 
overall moles of component i; Mq and Ug are the phase velocities given by 
Eqs. (2.4) below; Vm, Vo and Vg are the ratios of the water, oil and gas phase 
volume to the pore volume, respectively; g*" and g* are the mass flow rate of 
water and the ith component. Under the assumption that the pore volume 
of porous media is fully filled with fluids, the following volumetric constraint 
holds ^ 



2 A Compositional Model 

A compositional model is presented under the following assumptions IDEIHI: 




( 2 . 1 ) 




= q\ 1=1,.. .,N. (2.2) 



Vt = Vn, + Vo + Vg = 1 . 



( 2 . 3 ) 
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The following multi-phase Darcy’s law gives the relations between the 
phase velocities u^, and Ug and the phase pressures p^, Po and Pg: 



Uc = -K 



i^ra 

l^OL 



(vpa - a = w,o,g. 



(2.4) 



In Eq. (2.4), K is the absolute permeability of rock which may be a full tensor 
for anisotropic and heterogeneous porous media, krw, kro and krg are the 
relative permeabilities of rock that are functions of saturations, pu,, po and 
Pg (or Pm, Po and pg) are the viscosities (densities) of the fluid where pw (or 
Pm) is constant and po and pg (or po and pg) are the functions of pressure and 
composition of hydrocarbon. Am = krw/ Pw, Ao = kro/po and Xg = krg/ Pg 
are the phase mobilities. Qc is gravity acceleration. 

In addition, the nonlinear PDEs presented above are coupled with a com- 
plex phase package that describes the relative phase equilibrium for a given 
pressure-volume-temperature through the equation of state (e.g., the Peng- 
Robinson equation of state). 

Eqs. (2.1)-(2.4) form a strongly coupled system of nonlinear PDEs and 
constraints. By the Gibbs phase rule one concludes that the system is uni- 
quely determined by iV -|- 2 extensive variables, which are called primary 
variables |p. Other variables can be expressed as functions of these varia- 
bles. To solve this system efficiently, one should choose the primary variables 
carefully based on all the inherent physical and computational properties. 
We choose the first iV -|- 1 primary variables as in many papers PEI El in, 
which are the pressure of oil phase p, the overall mass per pore volume of the 
hydrocarbon components mX (* = !,..., iV), and the water saturation Sm- 



3 A Sequential Solution Method 

3.1 An MFEM for the Pressure and Total Velocity 

Notice that vt = vt{p, Sw,'m^ , ■ ■ ■ , ) is a function of p, Sm , and m^, ... ,m^ 

If one differentiates the constraint equation (2.3) with respect to t and repla- 
ces dsw/dt and dmX /dt by Eqs. (2.1) and (2.2) incorporated with the Darcy’s 
law (2.4), one obtains the following pressure equation PEH: 







I mlkro 
y VoPo 



'mfgkrg 

VgPg 



V-f 





P= rp, 



(3.1) 
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where p is the pressure of the oil phase, (dr = —(j){dvT/dp) is the total fluid 
compressibility that is positive because dvr/dp is negative, and Vp is the 
volumetric discrepancy error. 

Eq. (3.1) is a parabolic PDE with respect to the pressure p and can be 
solved by EDMs, FEMs, or flnite volume methods (FVMs). After the nume- 
rical solution p^ is obtained, one computes the numerical phase velocities u^, 
Mg and Ug by substituting into Eq. (2.4) and using the capillary pressure 
curves. However, when the subsurface geology is strongly heterogeneous, the 
absolute permeability K of the rock can be very rough. In this case the exact 
solution p of Eq. (3.1) is not necessarily smooth and so the numerical solu- 
tion p^ might not be accurate. As a result, the numerical Darcy’s velocities 
Vlw: Ug obtained from Eq. (2.4) by numerically differentiating p^ and 

multiplying p^ by a rough coefficient k/p are even less accurate. This, in 
turn, affects the accuracy of the numerical approximations to other primary 
variables when uj) and are put into Eqs. (2.1) and (2.2). 

While the pressure p may be rough, the total Darcy’s velocity u = u„„ + 
Vlo + Mg is usually smooth. Moreover, it is the phase velocities instead of the 
pressure that are needed in Eqs. (2.1) and (2.2). Hence, we adopt an MFEM 
to solve the following system of first-order PDFs for the total Darcy’s velocity 
u and the pressure p simultaneously f2lE|: 

+ V -u = Rp, u + \t'KVp= Ru, (3.2) 

at — 

where Rp and Ru involve the temporal and spatial derivatives of other pri- 
mary unknowns, but not u and p. At = A^j -k Ao -I- Xg is the total mobility. 
The operator d/dt is defined by 




After the total Darcy’s velocity u is obtained from equation (3.2), the phase 
velocities u„j, and Ug can be computed by (with a =water, oil, and gas) 



= /o 






V(Pcgo - Pcao) ~ {Pj ~ Pa)9c^ Z 



(3.4) 



where fa = Aq,/At is the fractional flow function and Pcjo (or Pcao) is the 
capillary pressure between phase j (or phase a) and the oil phase. 

An MFEM for Eqs. (3.2) can be written as follows: And (u,p) £ V^, x Wh, 
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for any t G [0,T], such that 

(f,c) + (v-M,c) = VC GW., tG[0,T], 

((AtK)-Im,^) - (p,v-^) = (^Ruv,u) V^G V., tG [0,T]. 

Here V. and W. are the well-known Raviart-Thomas spaces 0 LLUj ' 

3.2 An ELL AM for the Transport Eqnations 

Substituting the phase velocities u^, u^, and Ug obtained from Eq. (3.4) into 
Eq. (2.1) and using the fact that the water phase is incompressible, we rewrite 
Eq. (2.1) as follows: 

+ V • (m/»(s^)) - V • (d“Vs„) = R’", (3.6) 

with given by 

N 

R’" = V • (^ D*Vm*) -k q'^. (3.7) 

Likewise, we can rewrite Eq. (2.2) as 

</>^ + V • (zW) - V • (d w) = * = l,...,iV, (3.8) 

where E* and R\^ are as follows: 




N 

«™ = ft + V-( ^ DWm^). (3.10) 

The analysis in PII21 shows that the advection part of Eq. (3.6) is least 
coupled with system (3.8) and the coupling among the equations in system 
(3.8) is relatively weak. Therefore, in Eqs. (3.6) and (3.8) we move the off- 
diagonal diffusion terms to the right-hand side to decouple these equations. 
In the next subsection, we present a sequential solution method to solve the 
nonlinear advection-diffusion equation (3.6) for the water saturation and 
to solve the decoupled and linearized advection-diffusion equations (3.8) for 
the total moles m® of each component. 

Notice that E® is in fact the barycentric velocity of the Rh component 
in the multi-phase and multicomponent mixture. It depends on both the 
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process of fluid flow and the phase equilibrium. The physical interpretation 
of Eqs. (3.8) is that a particle of component j in a multi-phase fluid is traveling 
with the velocity W while simultaneously diffusing at the rate controlled by 
Dh The coefficients of diffusion terms, and D*, come from the capillary 
pressure effects. In many cases, capillary pressure effects are small and the 
advection terms dominate these equations. Consequently, the exact solutions 
of the equations are usually smooth outside some small regions within which 
the solutions have sharp fronts that need to be resolved accurately without 
oscillations or numerical dispersions. Because of the numerical advantages 
of ELLAM approach, we solve each linearized transport equation in (3.8) by 
using an ELLAM scheme 



/ (j)rn^{-,tj)w{-,tj)dx + At / Vw{-,tj)'D’‘ym^{-,tj)dx 

Jn Jn 

= / (j)m\-,tj-i)w{-,t'^_-^)dx + At / Rl,^w{-,tj)dx, 



(3.11) 



where w{x,tj'_i) = limj_^j+ w{x,t). We refer readers to dnng for discus- 
sions on implementational issues. 



3.3 A Sequential Solution Procednre 

The system of Eqs. (3.2), (3.6), and (3.8) is a large, time-dependent, strongly 
coupled system of nonlinear PDEs coupled with the nonlinear phase equili- 
brium constraints. We present a time-marching, sequential-implicit algorithm 
to decouple the computations of phase equilibrium from the solutions of the 
system at each time step. The algorithm can be outlined as follows: 

1. At the time t = 0, the pressure p, water saturation Sw, and moles 
m} , . . . , of the hydrocarbon components are known from the given 
initial conditions for the model. 

2. Apply the phase package to determine the phase distributions ml, . . . , 
and TOg , . . . , m^ based on m^, , m^ . 

3. Compute the phase viscosities po and Pg (recall that is assumed 
to be constant) from the empirical correlation fl()| and compute the 
densities Po and Pg from the equation of state (e.g. Peng-Robinson) . 

4. Calculate the ratios Vo and Vg of phase volume to pore volume Vo = 
TTiol Po and Vg = mg! Pg, where mo = ml -|- . . . -I- m^ and mg = ml + 
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. . . +m^ . Then, calculate the saturations So and Sg by So = VofvT and 
Sg = Vg/vT m. 

5. Compute the relative permeabilities after the saturations are obtained. 
Steps 2-5 yield the coefficients for equations (3.2), (3.6) and (3.8). One 
then proceeds to the next time step. 

6. Apply an ELL AM ^ combined with the operator-splitting techniques 
to solve the nonlinear advection-diffusion PDE (3.6) for s^, and the 
linearized transport PDEs (3.8) for m} , . . . ,m^ . Here, in the coeffi- 
cients of Eqs. (3.6) and (3.8), the values of the corresponding variables 
are computed based on their values at the previous time step. 

7. Substitute the values of and m^, . . . at the current time step 
into equation (3.2), and apply an MFEM to solve Eq. (3.2) for the 
pressure p and the total velocity u at the current time step. 

8. If necessary, perform some iterations between Eqs. (3.2), (3.6), (3.8) 
and the nonlinear constraints at the current time step. 

9. Go back to Step 2 to update the coefficients at the current time step 
and repeat the above processes until one finally comes to time t = T. 

4 Numerical Simulations 

We use the method to simulate a ID, three-phase, multi-component flow 
with complex in situ phase behavior. The reservoir is 250 feet in length with 
the sectional area of 50 x lOO(feet^). The porosity of the reservoir is 20%. 
The initial reservoir pressure is 2000 psi and the reservoir temperature is 160 
F. The permeability of the reservoir is 2 Darcy, and the analytical relative 
permeabilities are given by the modified Corey’s model 



fc™=0.1s^, - (1 - So)^], krg = {I- SgY], (4.1) 



where the normalized phase saturations are defined as follows 




5 




The residual saturations of each phases are 



S'luj' — 0.35^ ^ oT — 0.25j ^ QF — 



(4.3) 
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Initially, the reservoir is filled with 20% water saturation and 80% undersa- 
turated oil saturation. Water phase is incompressible. The molar density of 
water phase is 3.467 lb — mole/ft^ and the viscosity is 0.5 cp (centipoise) . 
The oil phase is composed of 20% methane (light hydrocarbon component), 
20% butane (medium hydrocarbon component), and 60% decane (heavy hy- 
drocarbon component). The Peng- Robinson equation of state m is used 
in the flash calculations. The critical properties of these hydrocarbon com- 
ponents are listed in Table 1. The molar densities and viscosities of the 
hydrocarbon fluid are computed by the equation of state and Lohrenz rule 
mg, respectively. 





Peipsi) 


Tc (R°) 


Vc 


Wt 


CJ 


Methane 


673.1 


343.3 


1.59 


16.04 


0.014 


Buthane 


550.7 


765.4 


4.08 


58.12 


0.193 


Decane 


306.0 


1115 


9.66 


142.3 


0.489 



Table 1. Critical Properties 



We simulate an injection of 95% water and 5% hydrocarbon mixture (45% 
methane, 45% butane, and 10% decane) into the reservoir and the total 
injection rate is 1000 lb-mole per day. The water phase is mobilized by the 
injecting water, leading to a three-phase fluid flow process. 50 cells were 
used in space. The size of time steps is selected according to the time- 
step restrictions that are the maximum changes in (1) pressure, (2) overall 
composition of the hydrocarbon components, and (3) the phase saturations 
tolerated at each element. Since the initial guess for the fiash calculations is 
given following the characteristics by the ELLAM scheme (3.10), restriction 
(2) is relaxed. The left and right figures in Fig. 1 show the computed water 
and gas saturations vs at 50 and 150 days, respectively. Since water is the 
main stream of the injecting fluid, a water saturation front is formed and 
propagated as time evolves (at about 70 feet at 50 days and at about 170 
feet at 150 days). Because the reservoir pressure is increased due to the 
injection, the resident hydrocarbon fluid is vaporized to form a gas zone. For 
more numerical results, see 0. 

5 Summary 

The mathematical model for compositional flow involves a strongly coupled 
system of a large number of nonlinear PDFs and algebraic equations. Due 
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Figure 1 : The profiles of the simulation at 50 and 150 days. 



to the enormous size of the petroleum field applications, it is very expensive 
to solve the system in a fully-coupled and fully-implicit fashion in general. 
To derive a stable algorithm for solving the compositional system, we need 
to derive a proper form for the governing equations to alleviate the nonlinea- 
rities and couplings. Choices of primary variables, the proper forms of the 
equations to be solved, and implicitness and decoupling of the equations are 
of essential importance numerical simulations. 

With a proper choice of primary variables, we develop a sequential solu- 
tion procedure for a compositional model: (1) We use an MFEM to solve the 
oil phase pressure and the pseudo-total volumetric velocity. Instead of calcu- 
lating the phase velocities, we compute the barycentric velocities and obtain 
fairly accurate and smooth approximations. (2) We decouple and linearize 
the coupled nonlinear transport PDEs by using the barycentric velocities. (3) 
We use an ELLAM scheme to solve each decoupled and linearized transport 
PDEs, which yields accurate numerical solutions that are free of oscillations 
and numerical dispersion and permits the use of large time steps. Moreover, 
the utilization of the ELLAM scheme generates an accurate initial guess and 
considerably improves the efficiency for the flash calculations, which in turn 
alleviates the restriction on time step size bounded by the maximum relative 
changes in the overall composition of the hydrocarbon components tolerated 
at each element. These results show the strong potential of the method. 
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Abstract 

A finite volume dicretization has been used to solve compositional 
flow in porous media. Secondary migration in fractured rocks has been 
the main motivation for the work. Multipoint flux approximation has 
been implemented and adaptive local grid refinement, based on domain 
decomposition, is used at fractures and faults. The parallelization me- 
thod, which is described in this paper, strongly promotes code reuse 
and gives a very high level of parallelization despite low implemen- 
tation costs. The programming framework is also portable to other 
platforms or other applications. We have presented computer expe- 
riments to examine the parallel efficiency of the implemented parallel 
simulator with respect to scalability and speedup. 



KEYWORDS: porous media, multipoint flux approximation, domain decom- 
position, parallelization 

1 Introduction 

In this paper we are going to discuss some computational issues conected 
to flow in fractured porous media. The simulation of secondary migration 
jm cni has been the main motivation for the work O CHi. Secondary 
migration is the movement of hydrocarbons through a carrier bed/reservoir 
rock into a trap. In order to be able to simulate such a large and complex 
process as secondary oil migration processes, the model has to be simplified. 

Our model does not allow dynamical changes of the discontinuities. This 
means that the fractures and faults may not open and close frequently. The 
dynamical geometry variation, which exists in the model, is in the z-direction. 
Due to the difference in the overburden pressure and the pore pressure we 
may have a compaction of the rock or of the control volumes in the numerical 
discretization. To represent discontinuities, local grid refinement is used. 

Z. Chen, R.E. Ewing, and Z.-C. Shi (Eds.): Proceedings 1999, LNP 552, pp. 244-^^^ 2000. 

(c) Springer-Verlag Berlin Heidelberg 2000 
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For simplicity, only regular grid cells are used. The mathematical model 
describes compositional three-phase flow. Phase exchange between oil and 
gas and visa versa is allowed. In addition, the model includes thermodynamic 
equations, assuming equilibrium. The numerical model uses a block-centered 
flnite difference discretization technique. With the use of this technique all 
the unknowns are located in the center of the control volume. Most of the 
existing models use a simple two-point flux approximation (TPFA) for the 
flux calculation. This may be incorrect if there exists a discontinuity in the 
permeability or if the principal permeability directions are not along any of 
the grid directions. The two-point flux approximation method is also used in 
the model, but in addition we have the possibility to use a multi-point flux 
approximation (MPFA) method if necessary. The computer code is highly 
object oriented and it is written in C-|— 1-. Normally, the C-|— I- programming 
language is computationally slower than the FORTRAN 90 programming 
language. However, it is often much easier to parallelize an object oriented 
C-I--I- program than a FORTRAN 90 program. Another advantage when 
using C-l— I- is that it is easy to add/delete new/old objects and classes. 



2 Compositional Flow Model 

The mathematical model includes mass conservation, energy conservation, a 
generalization of the Darcy low, equations of state, and equilibrium calcula- 
tions. The components in the model are the hydrocarbons (ci, C 2 , ..., c„) and 
water. Totally the system includes ric = 1 + n components. 

Even though our general simulator (SOM) ^21E| includes this complex 
mathematical model, we have mostly used a simplified version of the model, 
named the symmetric Black Oil (SBO) model The phase calculation in 
this model is related to tables for bubble and dew points rather than fugacity 
calculations. In addition to allow some of the gas components to be in the 
oil phase, the SBO also allows some of the oil components to be in the gas 
phase. This means that the SBO model is a generalization of the black oil 
model |2|. The SBO model may also handle condensate systems. 

The primary variables in both models are the temperature T, the water 
pressure and all the molar masses N^, v = 1, 2, ..., ric. 

The mass conservation of component v of one chemical species which flows 
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through the porous media is given by: 

-[ mi, ■ dS = [ m„dV + f q^dV , v = 1,2, ...,ric ■ (2.1) 

Jcs dt Jqy Jqy 

Here m„, q„, CV, and CS denote the mass flux of component v, mass 
density of component n, source/sink density of component control volume, 
and interface of the control volume. Let C„„i denote the mass fraction of a 
component v in phase m, m = o, g, w. Then the mass flux and the mass 
density of the component v are respectively given by 

mj, 0i^gPg^g Ci/oPo'^O CiyuiPyjVyj , I 5 2, ..., Tic , 

mj, = (flpiCugPgSg ~\~ CuqPoSq CirwPw^w) i ^ ~ I 5 2 , ..., Tic • 

where Pm, Sm, and ip denote mass density of the phase m, saturation of the 
phase m, and porosity. The Darcy law for multiphase flow is given by 

Vm = -K - 7 fcVd) , m = g,o,w , 

k—g,o,w 

where iL, Pk, Pk, 7fc, and d denote absolute permeability tensor, genera- 

lized relative permeability for coupled multi-phase flow, viscosity of the fluid 
phase k, fluid pressure of phase k, specific gravity of fluid phase k, and depth 
(positive in the gravity direction). Equation (2.1) gives Tic equations (one 
for each component), while we have 3ric -I- 22 unknowns. The first 3nc -I- 21 
unknowns are C,,™, v = 1,2, ...,tIc, Pm, Sm, Pm, Pm smd kr^i,, m,k = g,o,w. 
The temperature T is the last unknown. To close the model we need to have 
2tIc + 22 more independent equations or relations. 

We assume that the three phases All the available pore space, which im- 
plies 

Sg + So + Syj = 1 . (2.2) 

We also know that the sum of the mass fraction, for each phase, has to be 
one 

Com = 1 , m = g,o,w. (2.3) 

V=1 

The density and viscosity are assumed to be functions of the temperature, 
the phase pressure and the mass fraction, while the generalized relative per- 
meabilities and the capillary pressures are functions of the temperature and 
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the saturations 

Pm — Pm Pm i 5 ■ • - j ? 'OT — g ^ W , 

Pm — Pm (^5 Pm : C±rnj C^aciri) 5 'OT — p ^ O^W , 

= kr^k(T,Sg,So,Sw) , m,k = g,o,w, (2.4) 

Pg Po PcgoC^: ^gi ^w') 5 
Po Pw — Pcwo{'-f^ ^g: Sn:^ • 

As the temperature may vary in time we must include the heat flow equation. 
To ensure conservation of energy, the model uses the integral expression 

{pu)dV- [ {kVT)-dS= - j hpu-dS+ [ qdV , 

(2.5) 

where 

pU = ^ ( 4^pSmnuiPm 4” UrPri^f 0p) , 

iri—g,o,w 

hpu = ^ ( hyjiPjYiVjYi . 

m—g^o,w 

where pm, Pr, Um, Ur, k, T, and h denote mass density of phase m, mass 
density of rock, internal energy of phase m, internal energy of rock, bulk heat 
conductivity, temperature, and enthalpy, respectively. Since the movements 
in the system are very slow, all the terms including velocity may be neglected. 
This is the reason why we have neglected the kinetic energy, the potential 
energy, the viscous dissipation in the fluid flow, and the deformation energy 
in the solid phase and in the rock. The only mechanical work rate included 
is the term related to normal stresses in fluid flow. 

Equations (2.2)-(2.5) give 22 new equations, we still need another 2uc 
equations to close the model. These equations are given by the assumption 
of thermal equilibrium for each time-step. It is convenient to use the Gibbs 
function in the thermodynamics of phase equilibrium. The Gibbs function is 
defined by 

G =U +pV -TS , 

where U, p, V, T, and S denote internal energy, pressure, volume, tempera- 
ture, and entrophy. 
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2.1 Pressure Equation 

The equations given above represent a closed set of equations to determine 
all the primary and secondary variables at a new time step. To calculate the 
primary variables we have used a sequential IMPEC (implicit pressure and 
explicit concentration) . The secondary variables are determined by using ex- 
perimentally based relationships and ’’flash calculations”, which is discussed 
in this work. A pressure equation can be derived in several ways. In this 
work, however, we have used the volume balance method (VBM) ^21 ^3 
to obtain the water pressure equation. We deflne the residual volume R as 

R = Vp- Y. 

k—g,o,w 

where Vp and Vk are the pore volume and the volume of phase k, respectively. 
They both depend on the primary variables, stating 

Vp = Vp{p^,W), Vk = Vk{T,p^,N^,...,NY , k = g,o,w, 

where 

= Nj^{T,p^,Ni,...,NnJ , 1^ = l,2,...,ric and k = g,o,w. 

The overburden pressure is given by IT = a + p , where a and p are the 
effective stress and the pore pressure, respectively. 

The residual volume is a function depending on time t. To satisfy the 
saturation condition (2.2) V t, the residual volume has to vanish V t. From 
this we obtain the volume balance 

R{t) = 0 V t . 

By assuming that we know the residual volume at some time t, denoted R(t), 
we are able to And the value for the residual volume at time t + At. This can 
be done by constructing the truncated Taylor expansion 

R{t + At) ^ R{t) + ^At = 0 . 

at 

If we use the chain rule for partial differentiation on we get 

OR dpw ^ dR dNi, R dR dW 

dp^lH^^dK^^~^t~dWl)t' 

y—1 

The partial derivatives of the residual volume with respect to the primary 
variables may be computed from equations of state and thermo-dynamical 
equilibrium conditions. Since we assume that the temperature is constant for 
this calculation, we do not have to include the term ^ ^ . 
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2.2 Boundary Conditions 

Normally, the equations governing the flow of fluids through porous media 
are second order partial differential equations. A Robin boundary condition 
is the most general type of boundary condition which is used for our problem 
and is given by 



Here n denotes the outward normal direction at some point on the boundary 
and a, f3, and 7 are constants which may be selected. The boundary conditons 
can be given explicitly or implicitly [T^ 1 1 iSj . 

3 Flux Discretization 

For the numerical approximation we use a control volume finite difference 
block-centered disretization. Each control volume CVi,i = where 

N is the total number of grid cells, represents a fixed cell in the grid. The total 
interface for CVi is denoted CSi. We denote the set of grid cells connected 
to CVi through a given flux molecule, including the grid itself, as Aii- The 
common subset of the interface between CVi and CVj, i ^ j, is denoted 
CS(^ijy This means that 



where rii is the total number of sub-interfaces on CSi and ji is the global 
neighbor CVj to CVi which share the local sub-interface 1. We then define 
■M.^(^i j-) as the set of cells which influence the flux molecule for the interface 
CS(^ijy The set depends (like the set Mi) on the flux molecule me- 

thod. If we use a two-point flux molecule approximation M)V^ = {*, j} 0, 
M)V) = {i,j,k,l,...} for other flux molecule methods. For a multi-point 
flux approximation the set -^}^i j) may contains 18 different cells for an inner 
sub-interface DIHI 3 Between Mi and we have the relationship 



By use of these definitions we may write a general flux approximation as 
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where Uq are named the transmissibilities. Note that these transmissibili- 
ties include both a solution-dependent part and a geometry-dependent part, 
which may be calculated independently. In this work we have consentrate on 
the calculation of the geometry-dependent part. For the calculation of the 
solution-dependent part we refer to inmsi. 

3.1 The Multi-point Flux Approximations (MPFA) 

The easiest way to calculate the geometry-dependent part of the transmis- 
sibilities is to use the two-point flux approximation (TPFA) for each sur- 
face. With use of this method only contains two cells and is given 

by -M-li = {hj}- The TPFA method has been known for a long time and 
frequently used in the reservoir simulation literature; see, for instance, |S|. 

A multi-point flux approximations method has been developed lately ^ 
E3IB|. As the name of this method indicates we use more than two points (two 
cell centers) to calculate the flux across the interface to CVi. In two 

dimensions Ml® n includes values from six different cells while the number 
of cells in increases to eighteen in three dimensions. 




Figure 1: Grid cells with interaction regions in two- and three-dimensions. 

When using this method the definition of the interaction region is essen- 
tial. FigureHshows a sketch of this region in both two and three dimensions. 
The MPFA method covers the domain of the differential equations by such 
non-overlapping interaction regions. Instead of calculating the transmissibi- 
lities for directly, this method calculates the transmissibilities across 

each sub-surface in the interaction region. To calculate the transmissibili- 
ties for the method adds the contributions from the two interaction 

regions involved. 

For simplicity, we have chosen to present the flux discretization techniques 
in two-dimensions. The implementation of these techniques in the SOM 
simulator are three-dimensional. 
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To calculate the flux across the interface segment AE (see Figure^, 
we may use the approximation 

r ^ ^ 

fn% = - {XKVu)-dS:^ 'V' tAqUq = ^tAiUi . 

Since the flux must be zero when Ui is a constant vector, it follows that 
Si=i = 0. There exist several MPFA methods which determine tAi 
D 13 0 dllHI The main difference between these methods is where they 
require continuity in flux and potential. The 0-method requiers continuity 
in both flux and potential at the points A,B,C and D, given in Figure^ 
In 1 1 2L II iS) they propose an improvement of this method, which ensures full 
continuity across the interfaces AE, BE, CE, and DE. 



3.2 Non-matching Grid Lines and the OSCI-Method 

As far as we know, the MPFA method has not yet been given for non- 
matching grids in three dimensions. In |2| some two dimensional cases are 
shown. In this section we use slave cells to handle this problem in both 
two and three dimensions. Figure [3 shows how we implement slave cells 




Figure 2: Figures show the grid with only real cells, with both real and slave cells 
and with the non-overlapping interaction regions between slave cells. 

and create non-overlapping interaction regions between these cells. We refer 
to this MPFA method as the OSCI-method (0-method with slave cells and 
interpolation) . 
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To create the new slave grid, all the lines from one refinement which touch 
another refinement or a coarse cell are extended to the next crossing line (see 
Figure n. This means that one real coarse cell is divided in the number 
of fine cells touching the coarse interface. If lines from different refinements 
match, slave cells are identical to real cells. Thus, if the real cells create 
a non-matching grid, the OSCI-method creates non-overlapping interaction 
regions between slave cells. Then this method calculates the transmissibilities 
on this non-overlapping slave interaction regions in the same way as the O- 
method does across interfaces for matching grids. Note that this technique 
does not require regular grid density, but otherwise it is a general technique 
that fits all kinds of adaptive LGR. 

After we have calculated the slave transmissibilities, an interpolation is 
needed. Either we may interpolate these contributions to the real cells or 
we may interpolate the quantities in the real cells out to the slave cells. We 
have chosen the latter alternative. So far only a linear interpolation has been 
carried out, but it should not be a problem to implement higher order and 
more accurate interpolation techniques. Since this problem is beyond the 
scope of this work, it is left for future work. 

For a uniform grid in two dimensions with a homogeneous permeability 
tensor K, it has been shown that the OSCI-method creates a second order 
method. This is identical to the 0-method across interfaces for matching 
grids. Another advantage with this technique is that it is quite simple to 
implement. The drawback with this method is that it creates non-overlapping 
interaction regions between slave cells and not between real cells. Figure 0 
shows how we cover a domain by non-overlapping real interaction regions and 
non-overlapping slave interaction regions. The 0-method and OSCI-method 




Figure 3: The domain covered by non-overlapping real interaction regions (grey 

regions) and non-overlapping slave interaction regions (red regions). 
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have been compared dHIHl and it turns out that the OSCI method works 
well for nonmatching grids. 

3.3 Adaptive Use of the TPFA and the MPFA Methods 

Based on the differences which exist between a TPFA method and a MPFA 
method, it is possible to decide which flux approximation method to use in 
different cases. The MPFA method should be used for the cases |7| 

• Whenever the medium is anisotropic and non-aligned with the local 
frame of reference. 

• Whenever non-orthogonal and/or unstructured grids are employed. 

• Whenever fine-scale cross-flow upscaling is performed particularly for 
cross-bedding. 

The OSCI-method may also be used to match different solutions from dif- 
ferent domains. Since the convergence of a domain decomposition method 
depends strongly on the accuracy of the exchange of information between 
different domains, one should expect that a MPFA method improves the 
convergence. Thus an improved Mortar method 0 may be developed. 

It is only due to the large computational cost that we should avoid to 
use am MPFA method in the total domain. Normally, when using a LGR 
technique, an MPFA method should be used in the refined regions where the 
solution includes large gradients, while a TPFA method should be used in the 
coarse domain where no large changes exist. The SOM has the possibility to 
combine the two flux approximation methods like this. In an input file the 
user has to specify the flux approximation method for each sub-regions. The 
only restriction related to this specification is that TPFA method has to be 
used between coarse grid cells in a composite domain. 

Trying to reduce the large computation time caused by MPFA, a splitting 
in the matrix level is suggested in j7j . This paper shows how we may decom- 
pose a general full tensor flux into a diagonal tensor flux together with cross 
terms. Then, time-split semi-implicit, stable, full tensor flux approximati- 
ons are introduced within a general finite volume formalism. The standard 
diagonal tensor Jacobi matrix structure may now be retained while ensu- 
ring spatial consistency of the discretization. With the use of this M-matrix 
flux splitting method for a general full tensor discretization operator, the 
computation time is reduced by over 50% for quadrilateral grids. 
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4 Local Grid Refinements (LGR) 

One of the main objectives in this work has been to implement a LGR in the 
SOM framework. In this section we try to describe how we have solved this 
problem. Even though the main interest is to use the LGR for fractured and 
faulted problems, we have selected a general LGR which also may be used in 
conjunction with resolving wells or larger gradients. 

In this work we have used a fixed coarse grid and defined refined sub- 
domains from a pre-selected set of coarse grid blocks. In many situations, 
some care must be taken when constructing the coarse grid. If a large gradient 
is located near a coarse grid boundary, the numerical solution is affected. A 
dynamic adaptive coarse grid can be favorable in cases where we have large 
moving gradients (i.e., a moving front). In these cases, it is important to use 
a refined grid in a region containing large gradients. This has been used and 
studied in p). A fixed coarse grid is natural to use in cases where we try to 
resolve the dynamics at larger fractures and faults in addition to wells. If we 
know where these objects are located, this information must be used when 
constructing the coarse grid. As far as possible, regions with large gradients 
must be located inside each coarse block and not on or near the interface 
between them. 

Based on the discussion above, we could also have used an adaptive LGR 
strategy. First, we create the fine grid, including large fractures and faults. 
Then the coarse grid could be constructed adaptively. The coarse grid must 
depend on the solution and the numerical behavior. The adjustment of the 
coarse grid could depend on the information from a first iteration. This 
option must be included in further development of the simulator. 

There are two main difficulties related to LGR: 

• to create the composite grid, 

• to store all the data in an efficient and suitable manner. 

We have chosen to use the existing coarse grid generator as a starting 
point. After we have generated a coarse grid, we determine all the coarse 
cells which have to be refined. Then we use a grid generator for each of 
prescribed refined subdomains. The gridding can be different in each sub- 
domain and the sub-domains can be arbitrary placed in the coarse grid. This 
means that we are able to create arbitrary refined domains; see Figure 2] 
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There is no global numbering of the geometry variables in this implemen- 
tation. The grid generator also produces input files for the lithology mapping 
and boundary values mapping for each refined sub-domain. This means that 
we can prescribe an independent set of lithologies and boundary values for 
each sub-domain and the coarse domain. After we have created the geometry 
input files for each refinement and the coarse region, SOM has to be modified 
for achieving the use of these files. Since SOM is written in C-I--I-, this is done 
in an object-oriented framework. By using local numbering on each refined 
sub-domain, we can preserve the original regular data-structure for each refi- 
ned patch. This means that we can use existing functions for discretization, 
linearization and fast linear solver for each refined sub-domain, even if the 
composite grid is not regular. This strategy may keep the cost related to the 
implementation of a LGR in the SOM framework at a minimum. 

It is also possible to use dynamical allocation of new elements in the 
different class arrays in C-|— 1-. This means that it is straightforward to add 
or delete refined sub-domains adaptively during the computations. Further 
details about the implementation is given in EHIH]. 

5 A Composite Grid Solver 

Local adaption introduces several design challenges including: grid struc- 
tures, error control, efficient linear solvers and parallelism. The conver- 
gence rate and the complexity of iterative methods usually deteriorate with a 
decreasing size of the smallest mesh size. These small scales may even hinder 
direct methods since they determine the condition-number of the discretiza- 
tion matrix. 

If we can use existing fast solution methods developed on regular grid, on 
non-regular composite grids, we have made a large step towards developing 
efficient parallel algorithms for solving the same composite problem. 




Figure 4: This Figure show how we divide the coarse grid into many refined 

no-regular non-matching sub-domains. 



We assume that equations (2.5) and (2.6) have been discretized on a 
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composite grid as shown in Figure 0 Then we have the linear system to 
solve 

Ax = y, X , y G TZ^ . (5-1) 

Here A is the composite grid Jacobi matrix, x and y are the composite grid 
solution vector and the right hand side vector, respectively. N is the total 
number of grid cells in the composite grid. 

A global numbering of the nodes for this composite problem would dest- 
roy the nice regular banded structure in the matrix. This section describes 
an iteration technique that allows to decouple the refined patches and the 
underlying coarse grid. This means that the coupling is through the internal 
boundary between the coarse and the refined regions. We can then use fast 
regular solvers for each refined sub-domain. To attain this, we divide the 
composite grid H into different disjoint sets; He consist of grid points in the 
non-refined region and — Ur=i union of all grid points in the 

refined sub-domains. Here p is the number of refined sub-domains. After this 
partitioning equation (5.1) can be written 



■^CC 


A^ 1 




Xc 




Vc 


An 

L ^/c 


An 

J 




. . 




. yj . 



Our solution technique is based upon a two-step method, with a splitting 
based on a Galerkin technique. This is a two level domain-decomposition 
method, where we have one regular coarse grid and many regular refined sub- 
grids. This means that we split the linear equation (5.1) into different regular 
sub-problems, corresponding to each sub-domain. A coarse grid operator is 
used to handle the communication between the sub-domains. The Galerkin 
technique was first introduced in m This technique has later been studied 
in [TT7irninnT?j 

5.1 A Galerkin Composite Grid Solver 

The algebraic algorithm, proposed in m for solving the linear equation (5.1), 
can be formulated as follows. Let H denote the grid without any refinement, 
i.e. the regular coarse grid Hg in Figure 2| Further, let N be the number 
of grid blocks in H and let = {fine cells within coarse cell number f}, 
i = 1,2,...,A^. Defining the local refined sub-domains in this way, means 
that we only have one coarse grid block under each refined patch. Let N 
be the total number of cells in the composite grid, shown in Figure E] We 
define a basis vector g TZ^ , for the coarse cell i. It consists of only zero 
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element, except for unit element at all i € Mj, i = 1,2, ..., N. Let be the 
(s) iterate of the composite solution vector x. This gives us the function 



N 

z = = 

i=l 

'■(s) ' 

Here , is the i th component of d on H. Note that this function would 
describe an updated composite solution vector z, from the error on the coarse 
grid Using z as a test-function in a Galerkin formulation, then for each 
coarse cell j the equation should be satisfied: 






z G 77. 



N 



(5.2) 



= , i = 1,2, ...,7V, 



here (., .) is the inner product. This would give us a coarse linear system to 
solve for d 

, ^(«) ^(s) ^ - 

Ad = y, d ,y€TZ^, (5.3) 



where 



Hi= Y. Y Hd, 



N 

h = Y 

ieMf, r=l 



Here ki and li indicate the local cell numbers i and j in the coarse cells k and 
j, respectively. Note that if k is not refined, consists of only one coarse 

cell. After solving equation (5.3) for d , we use the definition for z from 
equation (5.2) to get the new estimate for the composite solution vector 

= £ 



This means that we choose When we solve for Xf^, on 

each of the refined sub-domains g = 1, 2, ...,p, we use the last updated 

-.(s+1) -*(s) 

solutions Xc and Xj^ as boundary conditions. This would be a kind of a 
two level block Jacobi formulation, and all the sub-domains can be solved 
in parallel. We can also use a coloring of the sub-domains, or use a typical 
two level Gauss-Seidel formulation. By using a block Jacobi formulation, 
each sub-domain q= 1,2, ...,p, can be given as 



A 



iaYu =yf^’ 



G 77 



N„ 



vfc ^yf,~Y 



?(s+i) 



where 
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Note that these problem can be solved in parallel, by using the latest updated 
boundary conditions. Then the updated composite solution vector is given 

by 



yis+l) 



kLc 



The iteration goes back to the definition in equation (5.2) and it proceeds 
until the composite solution vector satisfy 



7(s+i) _ yU) I 



?(s)| 



< e. 



Here e is a prescribed tolerance. 



6 Parallel Implementation 

The Galerkin algorithm described earlier can be formulated as an iterative 
process, where in each iteration we solve updated boundary value problem for 
each sub-domain. The work on each sub-domain, consists mainly of building 
the corresponding Jacobian and the right hand side. Then solving the cor- 
responding linear system restricted to one sub-domain, using values from its 
neighboring sub-domains as boundary conditions. The sub-problems can be 
solved in parallel because neighboring sub-problems are only coupled through 
previously computed values in the neighboring sub-domain. The basic buil- 
ding block for the Galerkin domain-decomposition method is the sub-domain 
solver. The sub-domains can be carried out completely independently, allo- 
wing the code to run on different processors of a parallel computer. 

Traditionally, domain-decomposed parallelization of PDE is done at the 
level of local matrix/vector operations and linear solvers. However, we are 
interested in a parallelization strategy at the level of sub-domain simulators. 
In |SIE]) they have proposed a Simulator Parallel Model (SPM) for paralleli- 
zing existing sequential PDE simulators, which is a Single Program Multiple 
Data (SPMD) model. Basically, the SPM based on domain-decomposition for 
developing parallel PDE software is: Assign one processor with one or several 
sequential simulators, each responsible for one sub-domain. The coordination 
of the computation among the processors is left to a global administrator, im- 
plemented at a high abstraction level, close to the mathematical formulation 
of the Galerkin method. 

The SPM strongly promotes code reuse, because most of the global admi- 
nistration and the related communications between processors can be extrac- 
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ted from specific applications. This means that we minimize the development 
costs and we have efficient control of performance parameters such as load 
balance, communication topology and domain-decomposition. 

The generic framework object-oriented framework for implementing a par- 
allel version of SOM, based on the simulator-parallel programming model, 
consists of three main parts: The sequential sub-domain simulator Sub- 
domain_Simulator, a communication part Communicator and a global 
administrator SOMJVianager. Figure 0 shows the simplified framework. 




Figure 5: The figure shows the object-oriented framework for the simulator parallel 
model. The three main parts are: the sub-domain simulators with local data, the 
global administrator and a communicator class. 

The sub-domain simulator is the most important building block in the 
model. It consists mainly of the code from the sequential simulator, but also 
of some ad-on functions to handle the parallel computing, synchronization 
and to be able to use communicated boundary values. We assign one sub- 
domain simulator, controlling one refined sub-domain, to each processor. In 
addition we let one processor controlling the coarse grid solver. 

During the parallel simulation, the concrete communication between pro- 
cessors occurs in form of exchanging messages and it is all handled by objects 
of Communicator. On each processor there is one such object connected 
to the local sub-domain simulator. This class uses member functions, given 
by each of the sub-domain simulators, to retrieve and modify its local data. 
The only concern of this class is sending and receiving data between each 
processor, and no actual computing is done here, just assembling and storing 
data. To solve these two problems, this class have a close connection to the 
OOMPI library. We have implemented a manager class SOM_Manager, to 



260 



Reme, 0ye, Espedal, and Fladmark 



administrate each of the sub-domain solvers. The class is controlled from the 
main program, and it has the global view if the concurrent solution process. 



7 Some Computer Experiments 

In this section we report parallel simulation results of some numerical expe- 
riments. The purpose is as follows: Examine the parallel efficiency of the 
implemented parallel simulator, due to scalability and speedup. We have 
carried out the experiments on a SGI Origin 2000, with RIOOOO processors, 
to explore the behavior of the method. 

7.1 Computer Experiment 1 

In the first computer experiment, we are mainly interested in exploring the 
parallel implementation due to scalability and parallel efficiency. The scaling 
factors are dependent of two parts. First the independent variables, which are 
the problem size and the number of processors. Then we have the dependent 
variables, which are the variables inside the respectively parallel application. 
We concentrate on the first group. By varying the number of grid blocks and 
the number of sub-domains used, we want to explore the scaling properties 
and parallel performance compared to the sequential program. 

The number of grid blocks, or the problem size, could have a large in- 
fluence on the parallel efficiency and the scalability of the parallel program. 
Larger problem size means larger memory requirements, which again leads 
to how the actual computer handles a larger memory in the cache. A par- 
allel program could have a great performance on small problems, but would 
be very slow on very large problems. Communication time depends on the 
number of grid blocks on each sub-domain and on the time for constructing 
and sending boundary data to neighboring sub-domain processor. 

Also, we want to vary, is the number of processors used. When increasing 
the number of processors, we expect that the used wall clock time would 
decrease. Ideally, it would decrease in a linear way. 

The partition of the domain is as follows. We divide the domain in 
equally sized sub-domains M and connect one processor to one sub-domain. 
The decomposition of the grid is done in 3D. In addition, we use one 
processor for the coarse grid. Thus, the total number of processors used 
becomes P = M + 1. With this partition of the domain we would have a 
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#p 


Speedup 


# N 


NX X NY X NZ 


CPU (s) 


9 


9.7 


2.88-10'^ 


40x30x30 


214 


9 


9.5 


5.76-10'" 


80x30x30 


534 


9 


10.6 


11.52-10" 


80x60x30 


1085 


17 


16.5 


2.88-10" 


40x30x30 


125 


17 


18.7 


5.76-10" 


80x30x30 


262 


17 


17.8 


11.52-10" 


80x60x30 


598 


33 


32.6 


2.88-10" 


60x60x10 


60 


33 


33.4 


5.76-10" 


60x60x20 


132 


33 


33.7 


11.52-10" 


60x60x40 


306 


49 


48.8 


11.52-10" 


80x60x30 


220 


49 


48.3 


23.04-10" 


80x60x60 


433 



Table 1: The used CPU time (in seconds) and the parallel performance for different 
number of processors P and different problem-sizes N. NX x NY x NZ are the 
number of grid blocks in each direction. In each grid cell we have 8 unknowns N: 
6 hydro-carbon components, the water pressure and the temperature. P = M + 1, 
where M is the number of sub-domains. The speedup numbers are calculated by 
comparing with the sequential CPU time. 



perfectly load balanced system. Since we use uniform lithology in fl, the 
computational complexities would be the same for all sub-domains. 

From Tabled we can see that the parallel efficiency of the parallel version 
of the simulator is very good. When increasing the number of processors by 
a factor of two, we then approximately decrease the CPU time by a factor 
of two. This means that the parallel code is very efficient due to the com- 
puting/communication relation. The scalability is given by the expression 
T(P, N) = T{kP, kN). Here T is the CPU time, P is the number of proces- 
sors, N is the total number of unknowns and k is the scaling-factor. From 
Table □ we see that this is almost given. The load on the Origin 2000 com- 
puter varies from time to time, which can explain some of the jumps in the 
given CPU time in Tabled 

Table Q has also a number for speedup, which is the result by comparing 
the parallel version against the sequential version of the code. Tabled shows 
that we have a super-linear speedup in this experiment. It is caused by the 
fact thata CPU working in parallel may have a faster access to the cache 
memory than a single processor has. 
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7.2 Computer Experiment 2 

The previous computer experiment was relatively simple. The rectangular 
domain-decompositions give a rectangular communication topology, which 
could improve the parallel performance. Many applications require a non- 
rectangular domain-decomposition/communication topology, which could af- 
fect the communication time. 




Figure 6: These figures show the oil which is migrating through a faulted carrier 
bed. The oil tends to follow the permeable layers connected by the very high per- 
meability crushed zones (red color). The coarse grid blocks (green color), represents 
almost non-permeable lithology. 

In the next problem we use a non-regular geometry with locally refined 
patches. In this implementation, we only use a very simple compositional 
model, the SBO na. This mainly interpolates from some dew-point tables 
and bubble-point tables. This simplification implies that, doing the flash 
calculation would not take much part of the time. 

We want to simulate the migration of oil through a faulted area. Parallel 
to the faults there are crushed zones with high permeability and high porosity. 
These high permeability zones may create some computational complexities. 
The computing domain and the domain decomposition are shown in Figure 0 
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The coarse grid blocks are representing a very low permeable litholology. In 
Figure El the coarse grid blocks are removed and we can clearly detect the 
faulted area. The permeability varies from almost impermeable, to 500 mD 
in the crushed fault zone. The layered sediments have varying permeability, 
but all below 500 mD. The aspect ratio, which is the relation between the 
horizontal and vertical permeability, is 10 in this experiment. 

The extent of the domain is 1000 m x 300 m x 140 m and the carrier bed 
are lifted 200 m above the horizon. As boundary condition we use a relatively 
large water flux = 1.0 • 10“'^ mol/m^s, from left to right. This is injected 
only in the permeable sediments. An oil flux Qo = 1.0 • 10“® mol/m^s, is also 
injected in the lowest portion on the left side. Only water can go out of the 
sediments on the right-hand side. The simulation period is 300000 days, or 
about 822 years. It should be noticed that all the lithologies data and the 
boundary conditions are realistic. 

Each refined subdomain follow a faulted block, containing 200 grid blocks, 
which means that we have about 2500 grid blocks in total. In Figure 0 we 
have given two snapshots of the oil saturation during the simulations after 
50000 days and after 250000 days. Here we can clearly see the oil migrating 
through the higher permeability zones. These are connected via the crushed 
faulted zone with a very high permeability. We can also see that some of the 
oil is trapped, i.e. on the top of two of the faults in Figure El Table El shows 
the CPU measurements for different parts of the IMPEC solution strategy 
in the code. The CPU measurements are after 10 days of simulation. The 
system would get stabilized during the simulation, so a longer time period 
would not be relevant for these results. 

We notice is that we have a great loss of parallel performance in the 
“Solve Pressure” function. This loss is mainly due to the bad speedup of the 
Galerkin solver, only 9.8. This experiment demonstrates that the geometrical 
complexities would strongly affect the time solving the pressure equation. In 
this experiment, we have located the high conductivity faults on the interface 
between two coarse grid blocks. We know that this may be a very bad choice. 
Instead, the faults should be located inside the coarse grid blocks, which 
probably would given a better numerical results. The overall speedup is still 
quite good 12.1, despite the poor performance in the pressure solver. 
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13 CPU 


Sequential I 


Function 


CPU (s) 


% t. loop 


speedup 


CPU (s) 


% t. loop 


Update rock data 


0.2 


0.26 


13.0 


2.6 


0.02 


Calc coupl. coeffi. 


0.5 


0.67 


13.0 


6.5 


0.7 


Solve temperature 


5.9 


7.9 


11.3 


66.7 


7.5 


Solve Pressure 


23.8 


32.0 


10.6 


255.0 


28.4 


*Eq. setup 


5.0 


6.7 


14.6 


73.3 


8.1 


*Eq. solve 


18.3 


24.0 


9.8 


178.4 


19.8 


Calc molar masses 


40.0 


53.3 


13.1 


523.4 


58.3 


Do flash calc. 


0.4 


0.5 


11.8 


4.7 


0.48 


Update physical var. 


0.8 


1.0 


11.3 


9.0 


0.88 


Total time loop 


74.4 


100 


12.1 


897.2 


100 


Total simulation 


74.8 


* 


12.1 


899.0 


* 



Table 2: This table gives the CPU measurements from the IMPEC solution 

strategy. The CPU time is a result of 10 days of simulation. 



8 Conclusions 

In this work we have used a fixed adaptive LGR technique. In simulation of 
reservoirs with fixed wells, faults and large fractures, certain fixed LGR tech- 
niques have proven to be very effective. Our fixed adaptive LGR technique 
creates non-matching interfaces, as nearly all LGR techniques do. 

The most important issue when working with fluid flow is to conserve the 
mass. To ensure mass conservation, the flux calculation across any of the 
interfaces (both matching and non-matching) have to be treated with great 
care. In most application, a two point flux approximation has been used. 
In this paper a muli point flux approximation (MPFA) is also implemented. 
As far as we now, a MPFA method has not yet been applied to cases with 
non-matching interfaces in three dimensions. In this work we propose to 
add slave cells close to the non-matching interfaces. Since these slave cells 
create a locally matching grid an existing MPFA method may be used. The 
Simulator Parallel Mode (SPM) which is described in this paper, strongly 
promotes code reuse and gives a very high level of parallelization despite low 
implementation costs. This programming framework is also portable to other 
platforms or other applications. With this model we have good control with 
performance parameters as load balance, synchronization and communication 
topology. 

We have presented two computer experiments to examine the parallel 
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efficiency of the implemented parallel simulator with respect to scalability 
and speedup. In the first experiment we examine the scalability for a large 
homogeneous one-phase problem. In the next experiment, we go a step fur- 
ther and study multi-phase heterogeneous problems. This problem requires 
a much larger portion of communication and synchronization, than in the 
first experiment. However, the experiments still show nice results for par- 
allel efficiency and speedup. In many industrial groups, clusters of PCs or 
workstations are used. This means that for effective use of the computers, 
a parallel implementation based on explicit message passing inside the net- 
work should be considered. Our parallel implementation framework puts 
no restriction for running the code on such networks/clusters with PCs or 
workstations. This option should be implemented in the future. Asynchro- 
nous MPI implementation should be considered, due to the heterogeneous 
computers in a typical network. 
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Relationships among Some Conservative 
Discretization Methods 

Thomas F. Russell 



Abstract 

Relationships among various mass-conservative discretization tech- 
niques for equations of the type —V • KVp = q on distorted logi- 
cally rectangular meshes are discussed. The case of heterogeneous, 
anisotropic K is important for applications to subsurface porous me- 
dia, in particular the groundwater flow equation and the pressure 
equation of petroleum reservoir simulation. Some methods are based 
on K itself, others on K~^. Within one of these groups, mass lumping 
and quadrature can be keys to understanding connections between me- 
thods; incomplete inversion of the mass matrix is useful in relating one 
group to the other. 



KEYWORDS: conservation, distorted grid, finite volume, mixed method 

1 Introduction 

The purpose of this paper is to introduce some concepts that could be use- 
ful in understanding relationships among various types of conservative di- 
scretization methods. We concern ourselves particularly with finite volume, 
flux-based finite difference, and mixed finite element methods designed to cal- 
culate accurate fluxes on distorted meshes for problems with heterogeneous, 
anisotropic conductivity. Such procedures can be important in a variety of 
applications, particularly subsurface flows in porous media. 

This study is merely introductory, and is far from exhaustive or definitive. 
The methods discussed here are sufficiently varied and complex that a com- 
plete analysis of their relationships would constitute a much longer paper. It 
is not our goal here to compare the merits of these approaches; other forums 
will be more appropriate for that. We also do not address the nontrivial is- 
sues of solving the discrete equations arising from these methods. Rather, we 
seek some simple tools that can assist efforts to comprehend a “big picture” 
of discretizations that can fairly be described as confusing. 

Z. Chen, R.E. Ewing, and Z.-C. Shi (Eds.): Proceedings 1999, LNP 552, pp. 267-^^^ 2000. 
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For simplicity of exposition, the spatial domain will be limited to 2-D, with 
3-D mentioned only in passing. This is a significant simplification, because, 
for example, the edges of a quadrilateral (bilinear image of a square) are 
straight, but the faces of a hexahedron (trilinear image of a cube) need not be 
planar, and while the Jacobian of a bilinear mapping is linear, in general for a 
trilinear mapping it is nonlinear. The 2-D methods discussed here extend in 
one way or another to 3-D, and are in various stages of development for 3-D. 
However, for our introductory purposes here, we find it best to emphasize 
2-D. For similar reasons, we consider only a linear pressure equation, and our 
focus is on logically rectangular meshes, though more complicated PDFs and 
grids of more general connectivity could be addressed. 

Section 2 describes the lowest-order Raviart-Thomas (RTq) ^2^3 mi- 
xed finite element (MFE) method, as a starting point to which other methods 
can be related. The analogous control-volume mixed (CVMFE) method ^ 
is introduced in Section 3, along with some modifications by Garanzha and 
Konshin [ 7 ] and some observations in [ 7 | that relate the CVMFE to the 
support-operators (SO) method of Shashkov and co-workers [Bl E! ■ These 
methods are related in a different way in Section 4 to the multi-point flux ap- 
proximation (MPFA) methods of Aavatsmark and co-workers min Section 
4 also briefly discusses the expanded mixed (EM) method of Arbogast et al. 
019 and the mixed finite volume (MFV) method of Thomas and Trujillo 
cni, and Section 5 summarizes the paper. 

2 RTq Mixed Method 

A MFE represents a partial differential equation as a system of lower-order 
equations, solving these for multiple variables of physical interest. In the 
context of porous media, we assume incompressible flow, neglecting gravita- 
tional effects, so that the pressure equation (with no-flow boundary condition 
for simplicity) takes the form 



V-(KVp) = q, 


X e 12, 


(2.1) 


— KVp-n = 0, 


X G 912, 


(2.2) 



where K (scalar or anisotropic tensor) is the mobility or hydraulic conduc- 
tivity, p the pressure, q a source/sink (e.g., well) term, and 12 is the reservoir 
or aquifer with boundary 912. Let v be the velocity vector, and express (ED 
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as a system representing Darcy’s law and conservation of mass, respectively: 

V = -KVp, (2.3) 

V • V = q. (2.4) 



A primary goal is to obtain a more accurate v, especially when K is hete- 
rogeneous, by solving the system for v and p, instead of solving 

(ED for p and applying (lO) to obtain v. Piecewise-constant test functions 
in ii'Z.ri) below will also yield local mass conservation. 

As in 1st ^ G H{div, fl) : w • n = 0 on i9n}, P = L^(n), write 
{\z:zk\ as 

K-\ + Vp = Q, (2.5) 

and arrive at the weak form of (I2.;tll - (r2.4ll . which is to find v G V and p G P 
such that 

/ K~^v-wdx— / V-wpdx = 0, w G V, (2.6) 

Jq Jq 

j V vzdyi = / qzdx, z G P. (2.7) 

JQ Jn 

The rectangular RTp elements define discrete subspaces V/j and Ph with 
respect to a cartesian grid. Ph consists of the piecewise-constant functions. 
'Vh can best be viewed by associating a degree of freedom with the flux 
(constant normal component times edge length |if|) on each inter-block edge 
E. A typical basis function has flux 1 on one edge {D in Fig. 1) and 0 on 
all others, with the flux varying linearly in the direction of the velocity. For 
arbitrary quadrilateral grids, appropriate pressure and velocity spaces are 
defined via the Piola mapping [m[i2j (see Section 3.2). 




Figure 1. RTq velocity basis function on rectangles. 
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3 CVMFE and K~^ Methods 



In addition to the local mass conservation provided by (12.711 . the CVMFE 
seeks a local discrete Darcy law, which an engineer could view as applying to 
a cell-sized “tank” with pressures imposed at the ends. In the MFE context, 
this is analogous to the formulation of control- volume finite element (CVFE) 
methods from Galerkin FE, hence the name. We describe the formulation 
first on rectangles, then on distorted quadrilaterals. 

3.1 Rectangular Grid 

Starting from the system (E^ - (E3t . consider the pressure cells Qij and 
the control volumes Qi+ij 2 ,j and Qij+ 1/2 associated with the edge fluxes 
{fx)i+i/ 2 ,j and {fy)ij+i/ 2 , as depicted in Fig. 2. For this case, we take 
K = fc to be scalar. The unknowns are associated with cells and edges as 
in the MFE, and the trial functions for v and p are again the RTq spaces. 
In the MFE, the test functions w and z are from the same spaces; in the 
CVMFE, (L!.4II is still treated in this way, but the x- and ^-components of 
the vector Darcy law (E3| are instead integrated over “tanks” Qi+i/ 2 ,j and 
Qi j+ 1 / 2 , respectively. This is equivalent to taking the scalar product of (12.51 
with w = (1,0) on Qi+i/ 2 ,j and (0,0) elsewhere, and with w = (0,1) on 
Qi,j+i /2 and (0,0) elsewhere, respectively, and integrating. The resulting 
partial derivatives of p can be integrated out, leaving 



f 

J X: 



i+1 rVj+1/2 



k Vx{x,y)dydx 



CVj+i/2 



(p{xi+i,y) - p{xi, y)) dy = 0, 






'Vj-1/2 



(3.1) 

and similarly for the y-component. Using the RTq trial functions, the inte- 
grals such as ini) are expressed in terms of the unknowns p, fx, and fy. We 
then obtain the discrete Darcy equations on the “tanks”: in the cc-direction 
on Qi-\-lj2,ji 



0-i+l/2,j-A{fx)i-l/2,j + <^i+l/2,j-D{fx)i+l/2,j 

+0,i+l/2,j-,G{fx)i+3/2,j+Pi+l,j~Pi,j = 0; 

where A, D, G refer to edges as in Fig. 1, and 

1 

= 8 IQ- I ’ 



0‘i+ll2,j-D — 






8 \Q%,j 



8 |Qi+ij-| 



(3.2) 

(3.3) 

(3.4) 
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Oi+l/2,i;G 



1 

8 IQj+ijI 






(3.5) 



where Axi = Xi+ 1/2 - AxiJ^i = a;i+ 3/2 - Xij^xji, with an analogous 

equation in the y-direction on Qij+i/ 2 - 




Figure 2. Cells, unknowns, and control volumes for rectangular grid. 

Integration of (jZ3|) over the cell Qij (equivalent to multiplying by a test 
function z = 1 on and 0 elsewhere), together with the Gauss divergence 
theorem, yields the discrete mass conservation: 

{fx)i-lj2,j — ifx)i+l/2,j + {fy)ij-lj2 — {fy)i,j+l/2 = (3-6) 

Equation (1^ . its y-analogue, and (ED give rise to a symmetric system of 
linear equations that is solved for the pressures at block centers and the fluxes 
across edges. 

3.2 Distorted Quadrilateral Grid 

If <5 is a convex quadrilateral, then there is a unique bilinear mapping of 
a reference square Q = [0, 1]^ onto Q that sets up coordinates on Q. The 
pressure in Q is associated with the “center” of Q, meaning the image of the 
center of Q. Note that this is not generally the centroid of Q. 

The extension of the CVMFE method to general quadrilaterals requires 
that continuity of flux be maintained, so that the normal component of a 



272 



Russell 



velocity function must be constant on each edge. Then we can associate 
degrees of freedom with fluxes on edges, as in the rectangular case. In Fig. 3 
we show two adjacent quadrilaterals with coordinates determined by their 
local bilinear mappings. The velocity vector function Vi_|_i/ 2 j that has flux 1 
(hence normal component l/|£'i+i/ 2 j |) on the common edge £’i+i/ 2 ,j (labeled 
in Fig. 4) and 0 on the other edges is pictured. It is oriented along x- 
coordinate lines, and has constant normal component on each complementary 
y-line, with the flux varying linearly in the x-direction. 




Figure 3. Velocity basis function Vi+i/ 2 ,j on quadrilaterals. 



Let x,y € [0, 1] be the reference coordinates for a quadrilateral Q. Set 

^ (S'S) <-) 

^ (I'D O.S, 

to be the columns of the Jacobian matrix of the bilinear mapping. These can 
be viewed as the images of the vectors (1,0) and (0,1), respectively, under 
the mapping from Q to Q. For example, the in Fig. 3 is parallel to X, 

and after some manipulation, one can show that it is given on the left-hand 
quadrilateral Qij by 



i+l/2,ji^j y) 



Ji,j (^) y) 



where 



dx dy dx dy 
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is the Jacobian of the mapping from Q to Qij. In the right-hand quadrila- 
teral Qi+ij, everything is the same except that 1 — x and Ji+ij replace x 
and Jij, respectively. These velocity trial functions and unknowns (fluxes 
across edges) can also be obtained from those on rectangles by a Piola trans- 
formation m- Define also the corresponding unit normal vectors, rij, J_ Y, 
iij^ J_ X, pictured in Fig. 4: 



{dy/dy, -dx/dy) 
[{dx/dy)"^ + ’ 

{—dy/dx, dx/dx) 
[{dx/dx)'^ + {dy / dx)"^]^^"^ 



(3.11) 

(3.12) 




Figure 4. Control- volume mixed finite elements on quadrilaterals. 

It remains to choose control volumes and test functions. For the integra- 
tions of (E3), the control volumes are the quadrilateral blocks Qi^j, and the 
test functions are scalar characteristic functions of the control volumes, i.e., 
functions that are 1 on one volume and zero elsewhere. The Gauss divergence 
theorem then yields for quadrilaterals as well as rectangles. 

For the integrations of I2.5II . to mimic the steps leading to ini) , we 
use images of rectangular control volumes Qi+i/ 2 j and Qi j+ 1/2 under the 
bilinear mapping, as seen in Fig. 4. We again denote such control volumes 
by Qi+ij2,j and Qij+i/2- Qi+i/2,j in Fig- 4 will be the “tank” with pressures 
Pij and Pi+ij at the two ends. We seek test functions that allow the gradient 
of p to be integrated out, leaving differences of p. 

Let Qi+i/ 4 j and denote the “left-hand half” and “right-hand 

half,” respectively, of Qi+i/2,j- Then is the image of the right-hand 
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half, (1/2, 1) X (0, 1), of Q under the mapping to Qij. The original CVMFE 
0 used X as the test function, which led to integrals that could be calcula- 
ted analytically, with a factor of 1/J outside the integral; the modification 
suggested by Garanzha and Konshin 0 instead uses X/ J, which requires nu- 
merical integration but gains accuracy on highly distorted meshes. By their 
approach, the left-half p integral analogous to the one in ini) is 



/ Vp • (Xij/Jjj) dx = 

Qi + l/4,i 



0 Jl/2 



dp 

dx 



dx dy 



P{^,y)dy- / p{l/2,y)dy 

3 ^0 






(3.13) 



where the last step involves p at edge and cell centers and is exact for bili- 
near p. In defining the CVMFE equations, treat as exact, i.e., ignore 

truncation error. Similarly, the right-half integral yields 



' Qt+3/4,j 



Vp • (Xi+i j/ Ji+i j ) dx — Pi^ij Pi+l/2,j' 



Hence, by choosing the test vector field 






Xi,^/ di,j 

Xj^lj/ on 

0 elsewhere, 



(3.14) 



(3.15) 



we combine into 




2 



J 



Vp • w,+i/2j dx = Pj+ 1 j - pij, 



(3.16) 



and the edge value Pi+ij 2 ,j is eliminated. If desired, it can be recovered later 
in a postprocessing step. 

With the test function from (EUSl, the V term of (l?3ll can be calculated 
as in 0, leading to the discrete Darcy equation analogous to M4.2II . with edges 
denoted as in Fig. 1: 



0-i+l/2,j-,A{fx)i-l/2,j + 0>i^i/2J-D{fx)i+l/2,j + 0,i+l/2,j-,G{fx)i+3/2,j 

+ai-t-l/ 2 j;s(/y)ij + l /2 + ai+l/ 2 ,j-c{fv)i,j-l /2 

+0,i+l/2,j-,E{fy)i+l,j+l/2 + 0.i+l/2,j;G ifv) i+l,j-l/2 
TPi-t-lj' Pip — 



(3.17) 
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where the coefficients in (E33) are given by 



a 



i+l/2,j-,D 



[ [ (3.18) 

Jo Jl/2 

C ri/2 

/ / (l-i)(K 2+1 '^^+1)1 dtrdyj 

Jo Jo 



<^i+l/2,j-,A 


= f r (1 - i)(K-jX,,,) • dxdy, 

Jo Jl /2 


(3.19) 


«i+l/2,i;G 


C D/2 

~ 1 j / di+i,j dJ^ dy ^ 

Jo Jo 


(3.20) 


0'i+l/2,j\B 


~ 1 1 yi^i 1 dij dxdy^ 

Jo Jl /2 


(3.21) 


«i+l/2,i;C 


= f t (1 - y)(K-jY, J • dxdy, 

Jo Jl /2 


(3.22) 


Oi+l/2,i;E 


C D/2 

~ 1 1 y(.^i+l,j''^i+l,j) ’ ^i+l,j / di+l,j dx dy, 

Jo Jo 


(3.23) 


^i+l/2,j;F 


C /-1/2 

= / / iX ~ y)(^i+i j^i+ij) '^i+i,i / Ji+i,j dx dy. 

Jo Jo 


(3.24) 



The Darcy equation for the horizontal edge Tlij+i /2 is analogous, with the 
roles of the x- and ^-directions reversed. That equation, with and I Yjl . 
forms the discrete system for CVMFE. 

As the trial and test functions are different for CVMFE, the discrete 
equations may not be symmetric. A representative example is (I3-21I) . in 
which test function 'Wi+i/ 2 ,j interacts with trial function 'Vij+i/ 2 - Symmetry 
would require that yield the same result as 



^z,i+i/2;B — / / ^ i,j / di,j dx dy . (3.25) 

J 1 / 2 J 0 



Since and (13.2511 integrate over different (but overlapping) half-cells, 

symmetry cannot hold in general with variable J (with constant J, X, Y, 
as on a grid of parallelograms, symmetry does hold). However, with a qua- 
drature rule that receives no contribution from the nonoverlapping parts of 
the half-cells, i.e., one whose nonoverlapping points have y = 0 in (nnQ) and 
+; = 0 in (E2i, symmetry is possible. As noted by Garanzha and Konshin 
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0, for the half-cell in fl21l) . a rule of the form 




1 



U gdxdy 
/2 



r(5(l/2,0) + 2 g(l, 1 / 2 ) +5(1/2,!)) 



+ (1/8 - r)(5(l, 0) + 25(1/2, 1/2) + 5(1, 1)), (3.26) 



where 0 < r < 1/8, preserves symmetry and also integrates bilinear 5 exactly. 
With such a quadrature, the entire discrete system is symmetric, and for 
parallelograms all integrals are exact; in particular, the weights 1/8, 6/8, 1/8 
in (I3.3H ~ (I3.5I) are preserved. We note here that the corresponding weights 
for MFE are 1/6, 4/6, 1/6, and that MFE is symmetric with any consistent 
quadrature because the trial and test functions coincide. Because the MFE 
integrands that produce these weights are quadratic, it is possible within the 
MFE framework, with a rule that is not exact for quadratics, to produce 
the CVMFE weights and the CVMFE formulas resulting from quadrature 
rule (i;t.2till 0. In general, it is not possible to produce the exact CVMFE 
formulas in this way, as they are nonsymmetric. 

Quadrature provides a convenient framework within which to consider 
the technique of mass lumping, which is often used to simplify finite element 
methods, producing a diagonal mass matrix and reducibility to finite diffe- 
rences. In the context of CVMFE equations (tl. I iSj) -(I. S.24D . this is equivalent 
to setting x, 1 — x, y, and 1 — 5 equal to 1 whenever they are greater than 
1/2, and 0 otherwise, then integrating. In this would correspond to a 

quadrature with x = 1 and 5 = 0 or 1, i.e.. 



This quadrature preserves symmetry, just as (l.'1.2tiD does, but it does not in- 
tegrate bilinear 5 exactly; indeed, it must not, because it changes the weights 
1/8, 6/8, 1/8 in to 0,1,0. Garanzha and Konshin 0 (p. 24) note 

that this lumping produces the SO method 0E]j. Thus, CVMFE and SO 
are related in much the same way as CVFE and conventional point-centered 
finite differences. 




(3.27) 



4 Relationships to K Methods 

In the petroleum industry, there is considerable recent work 0 QEIIOI on 
discretizations that take a view dual to that of formulas such as itm . We 
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call the methods of Section 3 “K ^ methods” because, when complicated 
details are ignored, they reduce to a relationship of the form 

-K-if = Ap, (4.1) 

where f is the flux vector and Ap the pressure difference. K~^ appears 
discretely as a mass matrix, so that a combination of fluxes equals a pressure 
drop. “K methods,” on the other hand, adopt the perspective that 

f=-KAp, (4.2) 

expressing an individual discrete flux as a combination of pressure drops. As 
a prototype of these methods, we consider the MPFA scheme of Aavatsmark 
et al. pm and seek to relate K ^ methods to it. It seems natural to attempt 
to pass from (HP) to ( 14 . 21 ) by inverting K ^ . To avoid a full stencil in ( 14.21 
even for orthogonal grids, it is necessary to lump the principal weights in K“^, 
such as the 1/8, 6/8, 1/8 for CVMFE. For the contributions to caused 
by non-orthogonality and/or anisotropy, some form of incomplete inversion 
is suggested. 




Figure 5. Cells and interaction region for MPFA method. 

To describe the MPFA method, we refer to Fig. 5. The pressure p is to be 
determined at the cell centers and is assumed to be linear (not bilinear) on 
each piece I,II,III,IV of the interaction region. Thus, there are 12 degrees of 
freedom on that region, and 12 constraints are imposed: 4 cell-center values, 
continuity of p at the 4 points where the interaction-region boundary crosses 
a cell edge, and continuity of the flux across the 4 interfaces in the region. 
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To close this system, the relation 



Je = - [ KVp-ndS (4.3) 

J E 

between the pressure and the fluxes across edges is needed. With a bilinear 
reference mapping to a distorted quadrilateral, as in Section 3, this can be 
written 



where 



fx 

fv 



dp dp 



c 



dx dy’ 



(4.4) 



a = -J{KVx)-Vx = -{Y ■Y/J){Kn^)-n^, 
b = -J{KYy)-Yy=-(X-X/J){Kny)-ny, 
c = -J(KVx)-Vy=-(|X||Y|/J)(Kn,)-nj,. (4.5) 




This is best understood in a simple case, where a, 6, c are constants. In 
Fig. 6 we show a grid of parallelograms that meets this criterion, since J, X, 
Y are constants, and assume also that K = fc is a scalar constant. Referring 
to Fig. 6, solution of the 12 equations leads to P 

fo= (^a- ^ (P2-Pi) + | (l+ (P3-Ps)+^ (l- 0 {PA-P&)- (4.6) 

(For no distortion, c = 0, and (H.til) reduces to fn = a{p 2 — pi), as expected 
for finite differences.) If 9 is the angle of distortion (0 = 0 for rectangles. 
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X • Y = |X||Y| sin0), then J = |X||Y| cos 9, ■ ny = — sin0, and 

-Jk , -Jk , „ , X-Y 



X-Xcos2 6»’ 



b = 



whence 



Y- Ycos2 6i’ 
Jk 



“ 26 X-X’ 



c = k tan 9 = k 



c 1 X-Y 



J 



, (4.7) 



(4.8) 



Next, we evaluate the coefficients in the K ^ methods. For the configu- 
ration in Fig. 6, we write the analogue of (l?T7ll . 

o-aIa + ob/b + acfc + anfo + asfE + apfF + acfc + P 2 — Pi = 0, (4.9) 

where, by (lit. I 



an = 



6 fc-iX-X 

8 J ’ 



1 yfc-iX-X 



aA = ac = 



J 



1 fc-^Y-X 



as = ac = aE = ap = 



4 J - 

For MFE, the coefficients 6/8, 1/8 in l|4. 1 Djl become 4/6, 1/6; for lumping 
(SO), they are 1,0; for all of the methods, the 1/4 holds. As noted 
above, before inverting K“^, it is best to lump; doing so, and using 114. till . 



P2-Pi = 



yfc-iX-X 

J 



= a - 



26 



(/b + c(/b + fc + fp + fp)), 



{{I + eN)f)p, 



where 



IX- Y 



(4.11) 

(4.12) 

(4.13) 



fo= [a - — j (p2-Pl-e[(P4-Pl) + (P3-P2) + (Pl-P5) + (P2-P6)])- (4.14) 

The leading terms of dm and match. The terms of first order in c 

would match if 



4X-X 

and N is a matrix with I’s on 4 off-diagonals. Now, because 
(I + eN)-i =I-eN + 0(e2), 
we can perform an incomplete inversion and write 
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A calculation from (lOI) and era shows that actually 



c = — 4e I a — 



2b 



XX 

J 



(4.16) 



Thus, it could be of interest to study how these methods relate on significantly 
distorted grids with large aspect ratios. 

Arbogast et al. |2l Q| and Thomas and Trujillo m have developed other 
distorted-grid K methods derived from MFE formulations. The complexities 
are such that we can only give sketchy qualitative descriptions here. The ex- 
panded mixed (EM) method circumvents the difficulty of inverting by in- 
troducing an additional auxiliary variable, v = — Vp, and then incorporating 
the relation v = Kv into the weak form. The usual MFE for div(Kgrad), 
if reduced to a single equation for the pressure, leads to a discrete matrix 
N^M“^N, where the three factors essentially discretize div, K, and grad, 
respectively; M is sparse, but is full. The EM puts K on the right-hand 
side of the system, where its connectivity is sparse, in place of on the 
left. Low-order integration (midpoint and trapezoidal rules) leads to lum- 
ping and a compact stencil (9-point in 2-D, 19-point in 3-D) for use in finite 
difference codes. It seems likely that this scheme is related to lumped incom- 
pletely inverted methods, because such methods can obtain the same 
stencils in 2-D and 3-D, but this is speculative at this point. The method 
also uses Lagrange multipliers, representing edge pressures, where the grid 
is not smooth; another speculation is that these are related to the CVMFE 
edge pressure values that were eliminated in The aim of Thomas and 

Trujillo is a MFV formulation that admits both a discrete pressure and a 
discrete velocity that have the regularity of their physical counterparts, i.e., 
p G iJ^(n) and V G iJ(div,n). In the usual MFE approaches, v satisfies this 
criterion, but p does not. The discrete MFV pressure is continuous piecewise 
bilinear on a primal grid, with test functions that are constant on dual cells 
centered around the vertices of the primal cells. The trial and test functions 
for V are similar to those for the CVMFE method, but on a finer mesh. This 
appears to be substantially different from all of the other methods discussed 
here. 



5 Summary 

The challenge of constructing accurate discretization methods for heteroge- 
neous, anisotropic problems on distorted grids has given rise to a wide variety 
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of approaches. The ones considered here (MFE, CVMFE, SO, MPFA, EM, 
MFV) share the property of conserving mass locally on discrete cells. Thus, 
in some sense they are all finite-volume methods. Broadly, they can be cate- 
gorized as K or methods, depending on which way they formulate the 
conductivity coefficient. Typically a K method relates a flux to a combination 
of pressure differences, while a K~^ method relates a pressure difference to a 
combination of fluxes. Cell-centered finite difference and mixed finite element 
methods are standard representatives of these respective groups. Quadrature, 
mass lumping, and incomplete inversion of the mass matrix appear to be the 
concepts that can illuminate the interconnections between various schemes. 
Further study of these issues, particularly in 3-D, should be of significant 
benefit for practical modeling of flow in porous media. 
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Parallel Methods for Solving Time-Dependent 
Problems Using the Fourier-Laplace 
Transformation 

Dongwoo Sheen 



Abstract 

In this paper we summarize recent progresses on the parallel me- 
thod for solving time-dependent problems using the Fourier-Laplace 
transformation. These problems arise in the study of elastic wave 
equations with absorbing boundary conditions, for example. Instead 
of solving the time-dependent problems in the space-time domain, we 
solve them as follows. First, take the Fourier-Laplace transformation 
of given problems originally set in the space-time domain, and consider 
the corresponding problems in the space-frequency domain which form 
a set of indehnite, complex-valued elliptic problems. Such problems are 
solved in a natural parallel manner since each problem is independent 
of others. The Fourier-Laplace inversion formula will then recover the 
solution in the space-time domain. 



KEYWORDS: parallel method, Fourier-Laplace transform, parabolic and hy- 
perbolic problems 

1 Introduction 

A direction of solving in parallel a certain class of time-dependent problems 
will be discussed in this paper. Time-dependent problems are usually solved 
by using efficient time-marching algorithms, such as backward Euler, Crank- 
Nicolson, or any higher order methods. However, the nature of time marching 
essentially blocks the usual attempt to parallelize such algorithms along the 
time axis, since they require the knowledge of the solutions at previous time 
steps in order to advance to the next time step. The spirit of time mar- 
ching is in the sequential computing rather than in the parallel computing. 
Therefore, when time-marching algorithms are to apply for time-dependent 
problems, parallelization is naturally sought in solving space problems for 
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each time step using the solutions solved at previous time steps. In this di- 
rection, domain decomposition methods have been used in the past decade by 
decomposing the space domain into several subdomains and solving resulting 
elliptic problems restricted to subdomains for each time step. For such a di- 
rection, we refer PE][ziEio,[ii]iini and so on, as well as recent publications 
in major numerical analysis journals. However, these methods require heavy 
communication cost among processors in order to pass informations between 
neighboring subdomains. 

In 1993 and the subsequent year Douglas et al. glEUISI analyzed and 
solved acoustic and elastic wave equations in the space-frequency domain af- 
ter taking in time the Fourier transforms of the original space-time problems, 
and then the solutions in the time-space domain are obtained by the inversion 
formula. Since the frequency has no hierarchy, Fourier-transformed elliptic 
problems can be solved in parallel in arbitrary order without any data com- 
munication. This provides a basis for natural parallelism. After these works, 
with the aid of several coworkers the author has been developing the theory 
and extending its applications to viscoelasticity, parabolic problems, and li- 
nearized Navier-Stokes equations. See |Sl El 1201 El El El El- This 
paper essentially surveys in brief such approaches for solving time-dependent 
linear initial-boundary value problems via Fourier-Laplace transformation. 
However, we give a general setting for the description of problems and a 
unified approach to solve them. 

2 The Algorithm 

2.1 The Model Problem 

Let H be a Lipschitz domain in TZ'^, d = 2,3 and did its boundary. Set 
J = (0,oo). Consider the following problem: given / = f{x,t), g = g{x,t) 
uq = uq{x), and Ui = Ui{x), find u = u{x,t) such that 

a-^ +h— + Au= f{x,t), {x,t)€nxJ, 

I 

ac— + Bu = g{x,t), {x,t) G dit x J, (2.1) 

dll 

u{x,Q) = U(i{x), a( — (a:, 0) — ui(x)) = 0, xGil. 

Here, and in what follows, we assume that all the coefficients a, b, and c de- 
pend only on the space variable and are nonnegative, A is a time-independent. 
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symmetric, elliptic operator, and B is another time-independent, symmetric 
operator. Prototypes for A and B are the —A and where v denotes the 
unit outward normal to dfl. Also it will be assumed that f{x,t) = g{x,t) = 

0, t < 0. (2.1) covers parabolic and hyperbolic initial-boundary value pro- 
blems in the cases of a = 0 and a > 0, respectively. If a = 1,6 = 0,c = 

1, A= —A, and B = problem (2.1) describes the wave equation with the 
first-order absorbing boundary condition. 

In the sequel, problem (2.1) is assumed to be well-posed and numerical 
methods for solving it will be our main interests. 



2.2 The Transformed Problem 



Recall first that the Fourier-Laplace transform in time n(-, C)j C = cr iw = 
a{u>) iuj, of a real- valued function v{-,t) vanishing for 6 < 0 is defined by 



?'(-,C) 




e ‘'^v{-,t)dt, 



( 2 . 2 ) 



and if v is square integrable in t, that is \v{-,t)\‘^dt < oo, the inversion 
formula 



1 

27ri 



j^e‘^*v{-,C)dC= I 



0 , 



t > 0, 
t < 0, 



(2.3) 



holds, where P is a contour in the complex half-plane C with Re (c() > 0, 
and the integration is evaluated as Re C on the contour is increasing. If the 
integral in (2.2) converges for (^g = o-q -I- iujo, it converges absolutely and 
uniformly for all C with Re(C) > Re (Co)- The contour P will retain as a 
straght line (i.e., cr{co) is a constant function) if equation (2.1) is hyperbolic, 
and it can be deformed into the left half plane so that the spectrum of —A lies 
to the left side of the contour since the semigroup related with the parabolic 
problem will be analytic m- Let us restrict the contour P to be symmetric 
with respect to the real axis, and denote by P+ the upper half part of P lying 
above the real axis. Then, using ?;(•, C) = C)i the inversion formula (2.3) 
takes the simpler form: 



hm e«*v(-,C)dC=| 



v{-,t), 

0 , 



t > 0, 
t < 0. 



(2.4) 



Assume that u(x,-),ut(x,-), and autt(x,-) are square integrable in the 
second variable. Then, utilizing 



U((-, C) = Cm(-, C) - m(', 0), utt(-,0 = C) + (u(-, 0) - ut(-, 0), 
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problem (2.1) can be transformed into the following set of complex- valued 
elliptic problems for C on r_|_: find u = u(-,C) such that 

{aC + 6C + A)u{-, C) = /(•, C) - a(Cwo(-) - wi(-)) + buo{-) in fl, 

(2.5) 

(acC -I- C) = 'g{-, () — acuo(-) on dfl. 

Some remarks should be made on problem (2.5). 

Remark 2.1. In the hyperbolic case, that is a > 0, the above problem 
is of Helmholtz type. Thus if, in addition, acB = 0 and 6 = 0, it turns out 
to be an eigenvalue problem and existence is not guaranteed for arbitrary 
( = iw on the imaginary axis; otherwise, existence and uniquenss follow 
immediately. However, if acB ^ 0 for all x, solvability follows from the 
unique continuation principle and the Fredholm alternative. See for the 
acoustic wave problem and for the elastic wave case with absorbing 

boundary conditions. 

In the parabolic case, solvability is gauranteed with a Dirichlet, Neumann, 
or Robin boundary condition USmslEE!! 

Remark 2.2. The Fourier transformation (i.e, C = ioS) has been conside- 
red in pFn irariTi ri ^iTi)iimrri. iT2]. while Fourier-Laplace transformation 
has been applied in pij . 

2.3 The Fully-Discretization 

For C G r> tfie solution to problem (2.5) can be approximated by any of the 
finite element, finite difference, or finite volume method, which is denoted by 
Uh{-,C) so that it is a solution to the discrete problem: 

(aC^ + bC + Ah)u{-, C) = /(•, C) - a{Cuo{-) - ui{-)) + buo(-) in fl, ^ 
{ac( + Bh)ui,0 = ?(-,C) - ocMo(-) on dn, 

with Ah and Bh being suitable discrete approximations to A and B, respec- 
tively. 

Then the semidiscrete approximation Uh{-, t) to the solution of (2.1) can be 
obtained by using the inversion formula (2.4). We trun to the numerical eva- 
luation of the indefinite integral (2.4). For this, it may be useful to transform 
F+ into a finite contour by a smooth monotone function ip : (0,oo) — >■ [0,1] 
such that tp{0,oo) = [0, 1]. In PUj ip^Lu) = [g uggd with parameters r 

and q, which will be explained later. Also in = tanh(rw) is used. 

To evaluate the resulting integral on the transformed compact contour, we 
then apply standard composite quadrature rules of order q based on an N 
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uniform subdivision of the compact contour. Let yj,Wj,j = 0, • • • ,N be sui- 
table quadrature points and weights on [0, 1] of order q accuracy. Then, for 
the fully -discrete approximation Uh,N to the solution u{-,t) of (2.1), we have 

1 ^ 

Uh,N{;t) = -Im {■ ,a{i^~\y,)) + if^-\y,)) 

^ 3=0 

since 

Uh{-,t)= ^ J e‘^*Uh{-,0 d( 

= -Im / e‘’*Uh{-,z)dC 

TJ" Jr+ 

= — Im / (•, cr(w) -I- zw) {cr'(w) -I- i}dw 

Jo 

1 

{a'iij-^y)) + i} ^^^^{y)dy. 

Remark 2.3. Observe that Uh (•, (j{'tp~^{yj)) + itp~^{yj)) is independent 
of t. This is a crucial fact to our parallel method to be effective as for each j, 
Uh (•, cr('i/'“^(%)) -I- i'tf~^{yj)) can be computed as solutions to Problem (2.6) 
simultaneously. 

Remark 2.4. For efficient calculation of the approximation solution with 
the transformation f/'(w) = it is recommended to choose the parameter 

T such that t G (r, qr] where t is the time for the solution to be calculated. 

2.4 The Algorithm 

We summarize the algorithm in the following form. 

Let q be the accuracy of the quadrature rule on [0, 1], and choose appro- 
priate contour F+ and the transformation function ip. 

Algorithm 

Step 1. Take the Fourier-Laplace transformation of the given problem 
(2.1), and obtain the transformed problem (2.5). 

Step 2. Apply any spatial discreteization to (2.5) and solve the resulting 
problem (2.6) in parallel. 

Step 3. By the inversion formula (2.4), obtain the solution Uh^N- 
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3 Applications 

3.1 Parabolic Problems: Initial Value Problems 



We will abstract some results from m which will appear somewhere else. 
Consider the initial value problem: 



ut — Au = 0, for t > 0, with u(0) = uq. 



(3.1) 



Then solve the transformed complex-valued elliptic problems: 



Cu — Au = Uq, for Re > — 7 , 



for all C on a contour T = T.^ = {C = — 7 — wiiw; w > 0 } with Im C increasing 
from —00 to 00 . 

The solution u(t) of (3.1) is then obtained by the inversion formula after 
the application of the transformation tp{Lu) = 

The following error analysis has been obtained in EOl. 

Theorem 3.1 Let UN(t) denote the semidiserete approximation to the solu- 
tion u(t) of the homogeneous inital value problem obtained using the above 
method, then there exists C = C{Xq — 7 ) > 0, sueh that 



where 0 < t < t in the last one. 

The fully-discretized error — uat|| is then the usual finite element, 

or finite difference error plus the error in the above theorem. 

3.2 Parabolic Problems: Inhomogeneous Problems 

We consider the initial value problem: 

Ut — Au = /, for t > 0, with u(0) = 0. 

Then solve the transformed complex- valued elliptic problems: 



||uiv(t)-u(t)|| < C'||uo||e < 




(^u — Au = / 
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for all C on a contour F = = {(^ = ioj] uj > 0}, which is the ususal Fourier 

transformation taken. 

Again the solution u{t) is obtained by the Fourier inversion formula. 
u{t) = f —A)fdC = — Im / e‘’*it(C) dC for t > 0. 

ZTTi 7r Jr+ 

For more details of analysis concerning this approach, we refer to for 

instance. 



3.3 Hyperbolic Problems with Absorbing Boundary Con- 
ditions 



We consider the initial value problem: 

utt - Au = f, 
du 

Ut + — 0 , 

ou 

w(0) = Ut(0) = 0; 



n X (0, oo), 
dfl X (0, oo), 

n. 



Take the Fourier transform ((^ = iw) to the above equation to get 

— w^it + Au = /, n, 

(9?y 

ZWU+ — = 0, dfl, 

ou 



for all UJ. 

Again the solution u{t) is obtained by the Fourier inversion formula 
u{t) = — Im [ e‘’*u(() d( for t > 0, 

7T Jr+ 

For this approach, see iioj. 



3.4 Other Applications 

For the elastic wave equations with absorbing boundary conditions, the above 
approach with Fourier transformation has been applied unj. The method of 
applying Fourier transformation has been used to treat viscoelastic problems 
in iniini, and linearized Navier-Stokes equations in m- Also, an application 
to solve certain class of semilinear parabolic problems is now in progress by 
Ganesh and the author 0. 
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Cascadic Multigrid Methods for 
Parabolic Pressure Problems 

Zhong-Ci Shi Xuejun Xu 



Abstract 

In this paper we develop the cascadic multigrid method for para- 
bolic problems, which arise as the pressure equatious for the flow of 
compressible fluids in porous media. The optimal convergence accu- 
racy and computation complexity are obtained. 



KEYWORDS: cascadic multigrid, finite element, parabolic problem 

1 Introduction 

Bornemann and Deuflhard 001 have presented a new type of multigrid 
method, the so-called cascadic multigrid. Compared with usual multigrid 
methods, it requires no coarse grid corrections at all that may be viewed 
as a “one way” multigrid. Another distinctive feature is that it performs 
more iterations on coarser levels so as to obtain less iterations on finer levels. 
Numerical experiments show that this method is very effective for second 
order elliptic problems. 

In this paper we consider the cascadic multigrid for parabolic problems, 
which arise as the pressure equations for the flow of compressible fluids in 
porous media. Here we must treat the effect of discrete time steps. As pointed 
out in for a small time step r < 0{h?), where h is the space mesh size, 
some standard iterative methods, like the Richardson iteration, can guarantee 
a good convergence for the discrete system. But for a relative large time step 
r, recommended multigrid methods; see 0 for details. Now, we consider 
to use the cascadic multigrid. Similar to second order elliptic problems, 
it is proved that the cascadic multigrid with the conjugate gradient (CG) 
iteration as a smoother is accurate with the optimal complexity in 2D and 
3D and nearly optimal in ID. As for other traditional iterative methods, 
like the Richardson iteration, the cascadic multigrid still yields the optimal 
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accuracy and complexity in 2D and 3D and in a certain case of ID. Notice 
that for the second order elliptic problem, the cascadic multigrid with these 
iterative methods gives the optimal accuracy and computation complexity 
only in 3D and nearly optimal in 2D. They cannot be used for ID. 



2 Model Problem and Its Finite Element 



Consider the parabolic problem: Find u{x, t) such that 
3v 

— + Cu = f in fix [0,T], 

u{x,t) = 0 in i9nx[0,T], (2-1) 

u(x,0) = Mo(a;), 



where ft C R‘^ {d = 1,2,3) is a bounded domain, / G and £ is an 

elliptic operator 



r \ " ^ ! i 



dxi dxj 

1,3 = 1 



Here aij{x) satisfies 



^ Vx G G 

i,t=l 



where c and C are positive constants. The variational form of (2.1) is to find 
u G Hq{Q), u{x,0) = wo(x) such that 



du 



v) + B{u, v) 



if,v) yveH^in), tG[0,T], 



where 



. ^ dudv 

(/> = In 



Vm, V G 



We use the backward Euler scheme and Crank-Nicolson scheme for the time 
discretization 0. Both schemes are absolutely stable 0. Let be the nth 
time step and M the number of steps. Then = T. Consider the 

problem: For a given function gn-i G find w G i?o(fl) such that 



A^{w,v)=t ^{w,v) + B{w,v) = {gn-i,v) Vu G iLo(fl), (2.2) 
where r is the time step parameter. For the backward Euler scheme, we have 
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and for the Crank-Nicolson scheme, 

w = T = Atn/2, (gn-i,v) = 2{{f,v) - B{u'^~'^,v)). 

Now we define the r-norm by 

WvW"^^ = T~^{y,v) + B{v,v) V?;ei7o(ri). 

Let L; (Z > 0) be a quasiuniform triangular partition of with the mesh 

size hi = ho2~K L; is obtained by linking the midpoints of three edges of 
triangles on Lj-i. We assume that = Uk^FiK. Let Vi denote the P\- 
conforming finite element space on L;. Then we obtain the discrete form of 
(2.2): Find ui € Vi such that 

Ar{ui,vi) = (g,vi) yvi€Vi- (2.3) 

Define the operator : V/ — )> V) by 

{Ai^rUl,Vl) = Ar{ui,Vl) Vui,Vl € V). 

Then (2.3) can be expressed by 

Ai^rUi=gu (2.4) 

where gi £ Vi, {gi, v) = {g, v) v € V). 

3 Cascadic Multigrid Method 

We use the cascadic multigrid to solve (2.4) at each time step. Define the 
cascadic algorithm for (2.4) as follows: (1) set Ug = Mq = ug and let = u*_^; 
(2) for Z = 1, . . . , L, set (3) set u* = u^\ where Ci^r denotes 

the Richardson iteration procedure, i.e., 

- U?) = {I- Rl,rAr){ui ~ U°). 

Here R^r = (A; + and A; = 0{hj‘^). 

Following j2|, we call a cascadic multigrid method optimal on the level L 
if we obtain both the accuracy 

\Wl - uIWt ~ \\u-ul\\t, 

which means that the iterative error is comparable to the approximation 
error, and the multigrid complexity 



amount of work = 0{nif) 
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where ul = dimVL- Note that hi = K 

Consider sequences mi, m 2 , . . . , of the kind 

mi = (3.1) 

for some fixed /3 > 0, where [•] means the choosing integral function. If r 
satisfies that r < based on the observation in we know that some 
usual iterative methods, like the Richadson iteration, can already guarantee 
good convergence. Therefore, we only consider the case where t > . In 

such case, for any fixed r, we can find a positive constant 0 < 70 < 1 which 
satisfies 

^ < AZV70, (3.2) 

where 70 is dependent of r. 

Theorem 3.1. The accuracy of the cascadic multigrid with the Richard- 
son iteration for the parabolic problem is 

\\uL-ul\\r <C^~ ^ II5II0 for fd> { )^ 

m 2 I 1 J. + 70 

2 (1+70) 

where (3 and mL are defined in (3.1) and r is in (3.2). 

According to Lemma 1.4 in 0, we have 

Lemma 3.1. The computational cost of the cascadic multigrid is propor- 
tional to 

'^mini<C ^mLUL for (3 <2'^. 

Theorem 3.1 indicates that a large (3 can yield an optimal accuracy. Me- 
anwhile, Lemma 3.1 shows that the optimal complexity of the method can 
be achieved only for a small /3. Therefore, we have 
Theorem 3.2. If (3 in (3.1) satisfies 

(^—f</3<2‘^, d= 1,2,3, 

1 + 7o 

then both the optimal accuracy and complexity of the cascadic multigrid with 
the Ricchardson iteration can be obtained. 

Remark 3.1. From Theorem 3.2, it is seen that the cascadic multigrid 
with the Richardson iteration gives the optimal accuracy and complexity for 
2D and 3D parabolic problems. But for ID problem, it requires that the 
parameter f3 must be chosen to satisfy 



(2/(1 + 7o))"</3<2, 
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which turns out that the value 70 in (3.2) should be greater than 2^/^ — 1 
that prevents choices of a relatively large time step parameter t, say of order 
h in the Crank-Nicolson scheme. 

Remark 3.2. Compared with the parabolic case, for 3D elliptic pro- 
blems, the cascadic multigrid with the Richardson iteration gives the opti- 
mal accuracy and complexity. But for the problem in 2D, it gives only nearly 
optimal complexity. It cannot be used for ID elliptic problems at all 121 IZ|. 

4 Conjugate Gradient Method 

Assume that is an initial value of the CG method on the level 1 . Let 
be the m; steps of the CG iteration. Then the error of the CG method can 
be expressed by 



\\ui - C]^^U°i\\r = ^miii \\p{Ai^^){ui-v°i)\\r, 

pe^m, .p(0) = l 

where Pm, denotes the set of polynomials p with degree < mi [3] . 

Using a same argument of Theorem 2.2 in P|, we have 

Lemma 4.1. There exists a linear operator Ti^r = , where 

€ Pm and i^A,m(0) = 1, such that 

\\TrVl\\r<\\vi\\r ^ViGVi. 

Using Lemma 4.1 and following the same line of Lemma 1.3 as in |2|, we 
have 

Lemma 4.2. Assume that the time step parameter r > 0{h\). Then 
the accuracy of the cascadic multigrid with the CG method as smoother is 

CL^II^IIo for (3 = 2. 

(. rriL 

Remark 4.1. It should be noticed that the assumption on the time 
step parameter r > O(hjf) in Lemma 4.2 is not a real restriction since we can 
always assume t = 0{h\) for the backward Euler scheme and r = 0{h]f) 
for the Crank-Nicolson Scheme. Moreover, as pointed out in for a small 
time step parameter r < 0{h\), some standard iterative methods alone are 
efficient enough to guarantee a good convergence. 
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Combining Lemma 4.2 with Lemma 1.4 in 0, we get 
Theorem 4.1. (1 ) For 2D and 3D parabolic problems, the optimal accu- 
racy and complexity can be obtained for the cascadic multigrid with the CG 
iteration. (2) For ID parabolic problems, if we choose (3 = 2 and the number 
of iterations on the level L is 



rriL = [m,L], 

then the error of the cascadic multigrid method is 

\\uL-ul\\r<C^\\g\\o 

rrit 

and the complexity of computation is 

L 

mini < cm^nL{f + lognL)"^. 

1=1 



Remark 4.2. Besides the P\ conforming finite element, the above results 
are also valid for other conforming or nonconforming finite elements of the 
second order problem (see |3)- 

Remark 4.3. In practical computation, the right hand term gi in (2.4) is 
related to the cascadic multigrid solution of the last time step. According to 
0, the backward Euler and Crank-Nicolson scheme are absolutely stable, so 
the small perturbation of right hand term in (2.4) still assure the efficiency 
of our algorithm. 

Acknowledgments. The authors thank Dr Mo Mu and Dr Qiang Du of 
Hong Kong University of Science and Technology for useful discussions. 
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Estimation in the Presence of Outliers: 
The Capillary Pressure Case 

Sam Subbey Jan-Erik Nordtvedt 



Abstract 

The inversion of laboratory centrifuge data to obtain capillary pres- 
sure functions in petroleum science leads to a Volterra integral equation 
of the first kind with a right-hand side defined by a set of discrete data. 
The problem is ill-posed in the sense of Hadamard |^. The discrete 
data lead to a discretized equation of the form 

Ac = b + e, 

where b represents the observation vector, A is an ill-conditioned matrix 
derived from the forward problem, c is the coefficients in a representa- 
tion of the inverse capillary function, i.e., parameters to be determined, 
and e is the error vector associated with 6. If e* ~ N{0, and satis- 
fies the Gauss-Markov (G-M) conditions, then an estimate, c\, of c is 
BLUE 0. In the presence of outliers, the G-M conditions and/or the 
normality assumption can be violated. 

In this paper we parameterize the capillary pressure function using 
B-splines and address the issue of ill-posedness by reformulating the 
problem as a constrained optimization task involving the determination 
of the spline coefficients. By the nature of the experimental procedure, 
we expect the G-M conditions to be satisfied. A systematic method of 
outlier elimination and a choice of knots is employed to ensure satisfac- 
tion of the normality assumption and thereby derive capillary pressure 
curves to a high degree of accuracy. A robust method for estimating 
the solution curve, which accommodates both outliers and influential 
points, namely the Li-norm solution, is also presented. The method is 
demonstrated on synthetic data. 



KEYWORDS: Volterra, outliers, ill-posed, regularization, capillary pressure, 
estimation 
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1 Introduction 



The problem of determining capillary pressure functions from centrifuge data 
leads to an integral equation of the form 



K(x,t)f{t)dt = g{x), x€[a,b], 



( 1 . 1 ) 



where the kernel K is known exactly and given by the underlying mathema- 
tical model, g is only known with a certain degree of accuracy in a finite set 
of points , Xm, and the sought function f{t) is however continuous. By 

the nature of the right-hand side, g{x), eq. (1.1) is a discrete inverse problem 
which is ill-posed in the sense of Hadamard 0 . By a parameterization of the 
sought function, eq. (1.1) reduces to a system of linear equations of the form 



Ac = b + e, 



( 1 . 2 ) 



where b is the observation vector, A arises from discretization of the forward 
problem, e is the error vector associated with b, and c contains the model 
parameters. The matrix A is usually ill-conditioned. The ill-conditioning is 
closely connected to the parameterization of the probelm. As the dimension 
of the parameters increases, the spectra of A become increasingly dominated 
by very low singular values. Thus the condition of A increases with the 
increasing parameter dimension. 

The usual assumption is that e satisfies the following conditions: 



E(ci) = 0, F,{ef)=a'^, E(ei,ej) = 0 when i^j. (1.3) 



Eqs. (1.3) above are the Gauss-Markov (G-M) conditions. To be able to 
choose between the feasible vectors c, we introduce a merit number to define 
the goodness of fit associated with a feasible vector c. Suppose we have M 
observations; we form the vector 5 G by setting 



Si = bi — QrC 1=1,..., M, (1-4) 

where Or refers to the row of the matrix A and Si defines the residual 
for data point i, i.e., the difference between the measured values bi and the 
calculated value. 

We first make the assumption that the Ci’s are normally distributed. If 
we maintain this assumption and in addition assume that the G-M conditi- 
ons hold, then the residuals Si’s are independently and normally distributed 
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Figure 1: 

Figure 1: Schematic diagram of the centrifuge system. 



with zero mean and a variance of i.e., Si ~ N{0,a'^), Pj. If this is the 
case, then the least square (F 2 ) solution of eq. (1.2) is theoretically optimal, 
i.e., has the least variance. In fact, it is the best linear unbiased estimate 
(BLUE) P|. In practice, it is frequently assumed that experimental errors 
are normally distributed. This and the fact that of all Lp-norms only the L 2 - 
norm approach is linear account in part for the popular use of the L 2 -norm 
solution approach. 

A single point or a small number of points may violate either or both 
of our basic assumptions of normality and G-M conditions. A solution in 
L 2 “Horm will then not be theoretically optimal. These points which violate 
one or all of the G-M conditions and/or the normality assumption are termed 
outliers. In essence, we define an outlier as an observation inconsistent with 
the assumed model of the random process generating the observations. 

In this paper, by the nature of the experimental process, we expect the 
G-M conditions to be satisfied. We concern ourselves with the case when the 
normality assumptions are violated. We demonstrate a method for testing 
small sample data for the normality assumption and show how a systematic 
outlier detection and elimination can be employed to obtain the satisfac- 
tion of the normality condition and thereby a solution in the L 2 -norm. We 
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demonstrate this on the ill-posed problem of determining capillary pressure 
functions from centrifuge data. We show how capillary pressure curves can 
be estimated by reformulation of the problem as a constrained optimization 
task where the constraints are dictated by the physics of the problem. 

In a situation where the outlier accommodation rather than elimination 
is required, we present a robust method for estimating the capillary pressure 
functions namely the Li-norm solution. Here we treat only drainage capillary 
pressure curves. 



2 The Physical Problem 

From Fig. 1, the pressure at a radius r, Vi < r < re, is given by 

Pi{r,u) = - r^)/2. (2.1) 



By assumption, the capillary pressure Pc = Pnw — Pw = 0 at r = rg. Hence 
Pc{ri,uj) = ^Apuj^{rl - rf). (2.2) 

The average saturation of the porous medium is defined as 

S = [ S{r)dr. (2.3) 

re - Ti Jr, 



We substitute eq. (2.2) into eq. (2.3), and by a change of variables arrive at 
eq. (2.4) that relates the average core saturation, S, measured at the outlet, 
the capillary pressure, Pc{uj) and the sought point-saturation along the core, 
S. Thus, based on experimental data we need to invert: 



S{Pc{ri,(jj)) = 



fPc(ri,uj) 



K{Pe{n,uj),Pe{co))S{Pe{u;))dPe{u;), (2.4) 



where 



K{Pc{ri,uj),Pc{u})) 



1-g n 
re 

2Pc{ri,uj) 



1 



Pe{ri,uj) 



Eq. (2.4) above is a Volterra integral equation of the first kind. It is ill-posed 
and has numerical instabilities. Higher order numerical methods will there- 
fore diverge. The equation may be reformulated into the Volterra equation of 
the second kind which is more stable but this is not recommended as this will 
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involve the numerical differentiation of data. The equation must therefore 
be regularized. Among the methods of regularization which have proven be 
effective in solving eq. (2.4) are the Tikhonov regularization and the regula- 
rization by convexity constraints m- In this paper regularization is carried 
out in two stages. A pre-regularization by adequate paramterization to sa- 
tisfy the normality condition is followed by regularization by imposition of 
convexity constraints. 

3 Problem Formulation 

We parameterize eq. (2.4) above by representing S{Pc{uj)) by an m*^ order 
B-Spline as 

AT 

S{P,{u;)) = Y,c,Np{P,{u;),^. (3.1) 

f=i 

Here are the normalized B-spline basis functions, y is the spline partition, 
the Cj’s are the spline coefficients, and N is the number of the basis functions, 
otherwise refered to as the spline dimension. Substitute eq. (3.1) into eq. (2.4) 
to arrive at 

_ ^ fPcOi,^^) 

S{Pc{r,,uj)) = y^c, N^{Pciw)J)K{P,{r,,uj),P,iu;))dP,{u;). 

We split the integral above into two parts; from the zero capillary pressure 
to the displacement threshold pressure Pcd and from P^d to the capillary 
pressure at the inner face of the core. We know that S{Pc{ui)) = 1, for 
0 < Pc{^) < Pcd, and define 

pPcd 

f{Pcd,uj)= / K{Pc{r„tu),Pc{u;))dPc{u;). (3.2) 

Jo 

Thus we have 

S{Pc(ri,uj)) - f{Pcd,uj) 

D ^ (3-3) 

= Ef=i c, Np{Pc{co),^K{Pc{r,, co),Pc{u;))dPc{u;). 

Pcd enters into eq. (3.3) nonlinearly through f (Pcd, to) in eq. (3.3) and must 
be determined using the experimental data. A zero order search is per- 
formed within the range 0 < Pcd < Pc{oj)\-g^p optimal va- 
lue. Pc{uj)\g^p stands for the first experimental capillary pressure 



304 



Subbey and Nordtvedt 



for which the average water saturation is less than unity. Here we for sim- 
plicity assume that Pd is known. The Pd problem can be easily handled as 
outlined above; for a more complete analysis; see cni. S{Pc{ri,to)) has been 
measured at M points (Pc{ujk), k = ) and the saturation values 

{S{Pc{ri,0Jk)), k = 1, . . . , M) have been obtained in the form of readings. 
We therefore write the equation above as, fc = 1, . . . , M, 

S{Pc{ri,uJk)) - f{Pd,uJk) 

= Ef=i c, N^{P,{uj),y)K{P,{n, u;k),Pciu;))dP,iu;). 

We note that eq. (3.4) constitutes a system of linear equations of the form 

b = Ac, (3.5) 

where c contains the spline coefficients and the elements in the matrix A are 
given by the integral over the basis functions. We define 

bi = S{Pc{ri,uji)) - f{Pd,0Ji), 

= /^j"^’‘"’^iV™(P,(o;),y)i^(P,(n,o.,),-Pc(^))dPc(o^), 

i = 1, . . . , M, j = 

By the physics of the problem we require that the solution curve satisfy 
monotonicity and convexity constraints m- Monotonicity translates into 
demanding that the ^coefficients of our splines are ordered according to 

1 > Cl > C2 > C3 ...> Cat > 0. 

For m = 3, the above is a necessary and sufficient condition for monotonicity 
|7] and can be expressed in matrix form as 

Ec> F, 



-1 


0 


0 ... 


0 


0 


0 








1 


-1 


0 ... 


0 


0 


0 




Cl 




■ -1 ■ 


0 


1 


-1 0 




0 


0 




C 2 


> 


0 


0 


0 


0 ... 


0 


1 


-1 




Cat-1 




0 

0 
















Cat 




0 


0 


0 ... 


0 


0 


1 









By convexity constraints, we require that the second derivative of the ca- 
pillary pressure function be positive. The second derivative of a B-spline 
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expressed as a third order polynomial is 



dP (u>) (^) + bijPc{oj) + cij) — 2aij. 



The coefficients aij are known since they have to be derived during the 
formation of matrix A. Our convexity constraint translates into a matrix 
equation of the form 



where H is an N — 2 column vector with zero elements and G is a (TV — 2 x N) 
matrix defined as 



Based on our assumptions that the nature of our experimental procedure 
guarantees a satisfaction of the G-M conditions, if we further assume that 
the residuals resulting from the solution of eq. (3.5) are normally distributed, 
then a solution in the L 2 -norm, expressed through eq. (3.6), is the BLUE of 
c. However, due to the ill-conditioned nature of the system, we still need to 
regularize the system by introducing monotonicity and convexity constraints 
in the form of eq. (3.7). Hence the task is reduced to 



subject to 



4 Checking the Normality Assumption 

We adopt two methods of test, namely an approximate rankit plot and a 
Shapiro- Wilk normality statistic check. We discuss them briefly in this sec- 
tion. 

4.1 Rankit Plot 

If (5(1) < (5(2) < . . . < S(^M) are ordered values of M independent and identi- 
cally distributed random variables, then (see 0) 



Gc> H 




min^llfe — Ac\\^ 



(3.6) 



E[<^h)] ~ M + 7i, 7i = o--4> ^[(i-3/8)/(M-kl/4), 4>(a;) = -^= 



/ 



'X 




— oo 
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From this equation a plot of c5(i) against ji, where are the ordered values 
of Si, can give an approximately straight line. In that case the residuals 
could be taken to be approximately normally distributed and the errors to 
be approximately normal. 

4.2 Shapiro-Wilk Test 

If we assume the same distribution for S(i-) as above and set 

M 1 ^ 

^ (M-1) 

^ ^ 2=1 2=1 
then the Shapiro-Wilk test statistic is given by 

M 

W = J2a^S^^)/s, 0<W<1, 

i=l 

where oi , . . . , om depend on the expected values of the order statistic from 
a standard normal distribution and are tabulated in Shapiro-Wilk [^. We 
reject the null hypothesis of normality if W < Wa, where Wa is a tabulated 
critical value, also given in jSj. Values of W close to 1 indicate near-normality. 
Based on the rankit-plot and the Shapiro-Wilk statistic, we establish the fol- 
lowing algorithm, where is the Shapiro-Wilk statistic for spline dimension 
j (the spline order is held constant). 



Algorithm 1 (Shapiro-Wilk and rankit plot: L 2 -norm) 



Initialize: swap=0, dim=3 




1. for K = dim step 1 do 




w 


= swap 


if 


< w then goto step 2 


swap 


= 



2. if ^ > Wa such that P > 90%, accept null hypothesis. Adopt model 

with , else 

3. Inspect rankit plot for model with VF” and eliminate obvious outliers. 
Goto initialize. 

5 Validation-Synthetic Model 

Verification of the algorithm on synthetic data is attractive as our estimation 
results can be compared to the true capillary pressure curve. We use a 
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modified form of the parameterized capillary pressure function by Bentsen 
and Anli fP in the following form: 



Pc = Pcd 



<? — S' 

1 Sini 



- 1/0 



The integral in eq. (2.4) is then given by 

^wi Pi _j_ / ^Pcd 

I - Swi L L 



Ll = Te 




_ 2P,d \ 



" dx 

[(2-a:)xp’ 



(5.1) 



RANKrr PI.OT.(W^ t*ai ) 
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Figure 2: Rankit plot & cap. pressure curve-stage 1. 




Figure 3: Rankit plot & cap. pressure curve-stage 6. 



Eq. (5.1) can be solved numerically for Sw{uji). In the particular cases 
shown here, we have used the parameters in this presentation as Pcd = 5kPa, 
Swi = 0.1, L = 5cm, Ve = 9.38cm, Ap = Igjcm 3, and 0 = 1. For 15 different 
angular velocities, we calculate the average saturation using eq. (5.1). (In 



308 



Subbey and Nordtvedt 



fact, for these parameter values, the integral is analytical). We generate data 
for two cases. To include outliers we perturb the synthetic data using the 



relation 



-^perturbed 



(co,) = 1.2 sT 



-^perturbed 



(u;,) = O.SST 



{iv,), j = 5,10, 

{ujj), j = 6, 12. 



(5.2) 



All 15 data points including those perturbed above are refered to as partially 
perturbed data (pp-data). We add white noise using the relation 



(wj) = S. 



fpp—data 



{ujj){l + 0.02es,j), j = l,...,15. 



esj is drawn from a normal distribution with zero mean and unit standard 
deviation. This constitutes our first case, case 1. For case 2, we also perturb 
the last data point, 15, according to eq. (5.2). 



6 Results and Discussions 



Stage 


N 


max.VF 


P 


min. W 


M 


1 


6 


0.877 


2%<P< 1% 


0.821 


15 


2 


6 


0.960 


50% <P < 90% 


0.935 


13 


3 


6 


0.961 


50% <P< 90% 


0.928 


12 


4 


6 


0.952 


50% <P < 90% 


0.838 


10 


5 


6 


0.966 


50% <P < 90% 


0.928 


8 


6 


7 


0.975 


P = 90% 




7 



Table 1: Algorithm 1-case 1. 



In the first approach, the problem involing case 1 is solved by using a 
spline dimension of 6 to 7, which appears to give sufficient flexibility in the 
calculated capillary pressure curve. Regularization is achieved by the im- 
position of convexity constraints on the solution curve, and Algorithm 1 is 
employed to achieve compliance with the normality assumption. 

TableO shows important data at the various stages of application of Algo- 
rithm 1. Tabulated are the spline dimension, N, the maximum Shapiro-Wilk 
statistic obtained with the corresponding percentage level, P, the minimum 
Shapiro-Wilk statistic, and the number of experimental data employed at the 
given stage, M. 

Fig. 2 shows the rankit plots as well as the estimated capillary pressure 
curves for stage 1 of Algorithm 1. The plot order of the residuals at W = 
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0.821 is 6 q < Si 2 K Si K S 3 <t (5 i 3 < <5i5 < S 14 < ^2 < 1^11 < < 1^7 < c5g < 

(5io < <54. Inspection of the rankit plot at stage 1 shows that the 6th and 12th 
data points (Jg, S 12 ) deviate considerably from any straight line about which 
majority of the residuals have minimum deviation. These points could be 
flagged out as possible outliers. 

Fig. 3 shows the rankit plot as well as the estimated capillary pressure 
curve at stage 6 of Algorithm 1. At this point, we can state, at 90% level, 
that our residuals and errors are normally distributed (refer to Table 0 • We 
see that the estimated capillary pressure curve to a high degree reconciles 
the true one. From experience, a confidence level > 90% has been found to 
be adequate. 

Case 2 presents a unique situation where the last data point (15) is an 
outlier. Algorithm 1 could still be applied to this case. Indeed, at stage 5 of 
Algorithm 1, experimental data number 15 could be flagged as a possible ou- 
tlier. However, from a practical point of view, it will be unwise to delete data 
number 15 since we would then be unable to estimate the capillary pressure 
curve in the range between the 14th and 15th average saturation values. We 
therefore need a robust norm definition for the residuals which, when applied 
in combination with our monotonicity and convexity constraints, accommo- 
dates the outlier. This we do by the Li-norm expressed as 

min^y^^ \b — Ac|. (6.1) 

Thus we use a low-dimensional spline {N = 5) and solve eq. (6.1) subject 
to the constraints in eq. (3.7) to arrive at Fig. 2. Indeed, for this case, 
we adopted the spline dimension with the highest W in stage 1. Thus the 
solution curve represents that which is nearest to normality in stage 1. From 
Fig. 2, we realize that we are able to estimate the capillary presssure curve 
to a fairly high degree, accommodating all outliers. 

7 Conclusions 

1. We have developed an algorithm for detecting outliers based on viola- 
tion of the normality assumption. 

2. Outliers can be handled through Algorithm 1. 

3. In the case where the last data point is an outlier, a solution in the 
Li-norm is proposed, which accommodates the outlier. 
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Figure 4: Case 2 cap. pressure curve-Li-norm. 



References 

[1] Bentsen, R. G. and Anli, J., Using parameter sstimation technique to 
convert centrifuge data into a capillary pressure curve, SPEJ, Feb. 1977, 
57-71. 

[2] Blom, G., Statistical Estimates and Transformed Beta Variates, New 
York Wiley, 1958. 

[3] Gustafson, S.-A, Regularizing a Volterra integral equation problem by 
means of convex programming, Working Paper Nr. 138, 1991. 

[4] Hadamard, J., Lectures on the Cauchy Problem in Linear Partial Diffe- 
rential Equations, Yale University Press, 1923. 

[5] Hassler, G. L. and Brunner, E., Measurements of sapillary pressures in 
small core samples, Trans. AIME 160 (1945), 114-121. 

[6] Madansky, A., Prescription for Working Statisticians , Springer-Verlag, 
1988. 

[7] Schumaker, L. L., Spline Functions: Basic Theory, J. Wiley & Sons, 
New York Gity, 1981. 

[8] Shapiro, S. S. and Wilk, M. B., An analysis of variance test for normality 
(complete samples), Biometrika 52 (1965), 519-611. 

[9] Srivasta, S. and Sen, A., Regression Analysis-Theory, Methods, and Ap- 
plication, Springer-Verlag, 1990. 

[10] Subbey, S. and Nordtvedt, J.-E., Estimating sapillary pressure functions 
from centrifuge data, submitted to AIGhe. 





A Comparison of ELLAM with ENO/WENO 
Schemes for Linear Transport Equations 
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Abstract 

We present an Eulerian-Lagrangian localized adjoint method (EL- 
LAM) for linear advection-reaction partial differential equations in 
multiple space dimensions. We carry out numerical experiments to 
compare the performance of the ELLAM scheme with the essentially 
non-oscillatory (ENO) schemes and weighted essentially non-oscillatory 
(WENO) schemes, which shows that the ELLAM scheme outperforms 
ENO and WENO schemes in the context of linear transport PDEs. 



KEYWORDS: advection-reaction equations, characteristic methods, compa- 
rison of numerical methods, essentially non-oscillatory schemes, Eulerian- 
Lagrangian methods, transport equations 

1 Introduction 

Advection-dominated partial differential equations (PDEs) describe the dis- 
placement of oil by injected fluid in petroleum recovery, the subsurface con- 
taminant transport and remediation, and many other applications 0E11SI. 
Because of the moving steep fronts present in their solutions, the numerical 
treatment of these PDEs often presents severe difficulties. Standard finite 
difference or finite element methods (EDMs, FEMs) tend to generate soluti- 
ons with severe non-physical oscillations. While classical upwind EDM could 
eliminate these oscillations, they yield solutions with excessive smearing and 
potentially spurious effects related to the orientation of the grid. Two general 
classes of improved approximations can be identified from the literature: up- 
wind methods that use fixed spatial grids with some form of upwinding and 
the standard temporal discretization, and the characteristic methods that 
carry out the temporal discretization by characteristic tracking. 

The Eulerian-Lagrangian localized adjoint method (ELLAM) |n| was in- 
troduced by Celia, Russell, Herrera, and Ewing in solving (one-dimensional 

Z. Chen, R.E. Ewing, and Z.-C. Shi (Eds.): Proceedings 1999, LNP 552, pp. 311-^^^ 2000. 

(c) Springer-Verlag Berlin Heidelberg 2000 
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constant-coefficient) advection-diffusion PDEs. The ELLAM methodology 
provides a general characteristic solution procedure and a consistent frame- 
work for treating general boundary conditions and conserving mass. Thus, 
it overcomes the two principal shortcomings of the previous characteristic 
methods while maintaining their numerical advantages. We conducted nu- 
merical experiments to observe the performance of the ELLAM scheme with 
many widely used methods ; including the upwind EDM, Galerkin 

FEM, quadratic and cubic Petrov-Galerkin FEMs HHE], the streamline dif- 
fusion FEM jI3j, the continuous and discontinuous Galerkin FEMs [ 1 41 1 1 tij . 
the monotone upstream-centered scheme for conservation laws (MUSGL) and 
the minmod scheme 0IIH1. These experiments show that the ELLAM scheme 
outperforms these methods in the context of linear transport PDEs. We also 
proved optimal-order error estimates for the ELLAM schemes in mini. 

In this paper we present an ELLAM scheme and numerically compare 
the performance of the ELLAM scheme with the essentially non-oscillatory 
(ENO) and weighted ENO (WENO) schemes ITTlIT^ITni . 



2 An ELLAM Scheme 

2.1 Definition of Test Functions 

We consider a multi-dimensional linear advection-reaction PDE 

ct + V • (vc(x,t)) -I- AT(x,t)c = E(x, t), xefl, te(0,T], (2.1) 

where 12 C IR'^ is a bounded domain with a Lipschitz continuous boundary 
r = 512. A boundary condition 

c(x, t) = g(x, t), (x, t) € P(-^) X [0, T] (2.2) 

is specified only at the inflow boundary P^^^ identified by P^^^ = {x | x G 
r, V • n < 0}. In addition, an initial condition c(x,0) = cq(x) is specified to 
close Eq. (2.1). 

We define a quasi-uniform temporal partition on [0,T] by 0 = 2 q < G < 
t 2 < ■ ■ ■ < tAT-i <tN = T. Multiplying Eq. (2.1) by space-time test functions 
w(x, t) that are continuous and piecewise smooth, vanish outside the space- 
time strip 12 X 2„], and are discontinuous in time at time tn-i, we obtain 
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a space-time weak formulation 



c(x,tn)w(x,tn)dx 





■ n c(x, t)w{x, t)dsdt 



■ / / c{x,t){wt + 'v -Vw — Kw){x,t)dxdt 

J tn-1 J O 



(2.3) 



= / c(x,t„_i)w(x,t+_Jdx-|- 



F{x, t)w(x, t)dxdt, 



where w{x,t^_^) = limj_^j+ w{x,t), which takes into account the fact that 
w(x,t) is discontinuous in time at time tn-i- 

In the ELLAM framework 0|, the test functions w are chosen to satisfy 
the adjoint equation of Eq. (2.1) 



Wt -I- V • Vic — K w = 0. (2-4) 

Let y = r{9;x,t), with t G [fy-ifyn], be the characteristic determined by 

^=v(y,6<), with y|s=t = x. (2.5) 

Eq. (2.4) is rewritten as 

--^w{r{e-,x,t},0) + K{r{e]x,t),0)w{r{9;x,t},0) =0, ^ 2 . 6 ) 

w{r{0-x,t),9)\g^i =w{x,F). 

Solving Eq. (2.6) yields the following expression for w 

wiri9; X, t), 9) = w{5c, t)e~ (2.7) 

Therefore, the test functions w in Eq. (2.3) should vary exponentially along 
the characteristics r(0; x,t). Once w{x, t) is specified, w{v{0] x,F),9) is deter- 
mined completely along the characteristic r{0;x,t). Thus, to define the test 
functions w in the space-time strip O x [fy_i,fy], we only need to define w 
on 12 at the time and on the space-time outflow boundary T^*^) x i„] 

with = {x G r I V • n > 0}. 

2.2 Derivation of a Reference Equation 

To avoid confusion, we replace the dummy variables x and t in the second 
term on the right-hand side of Eq. (2.3) by y and 9 and reserve x and t for 
the points in 12 at time 2„ or on T x [2„_i, 2„]. Let 12(0) C 12 be the set of the 
points that will flow out of the domain 12 during the time period [9,tn\. For 
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any y G there exists an x G H such that y = r(0;x,t„). Likewise, 

for any (y,0) G there exists a pair (x,t) G x such that 

y = r(0;x,t). Therefore, 

f"(y,^)w(y,0) dyde 



' tfx — 1 ^ 



Jtn-1 Jn\VL{0) 

ptn P 



F{r{9;x,tn),d) w{r{9;x,tn),d) drd9 
F{r{9;x,t),9) w{r{9;x,t),9) dvd9. 



( 2 . 8 ) 



J tn-1 J n(0) 

The first term on the right-hand side of Eq. (2.8) is evaluated by applying 
the Euler formula at time t„, leading to 

F{v{9- X, tn),9)w{v{9] X, t„), 9)drd9 



/t„_i Jn\Q.(0) 

ft. 



Q. J t* (x) 

= / F{x,tn)w{x,tn) 
JQ 



F{r{9; x, t„), 6»)w(r(6»; x, t„),9) \ | d6»dx 



dx 



e-K(^,tAW-e)^0 



/f(x) 



(2.9) 



dx + Ei{f, w) 



= [ (j)^^^X,tn)F{x,tn)w{x,tn)dx + Ei{F,w). 
Jn 



Here the space-dependent time step At^^)(x) = t„— t*(x), where t*(x) = t„_i 
if the characteristic r(6*;x,t„) does not backtrack to the boundary T during 
the time period or t*(x) G [tn-i,tn] is the time when r(0;x,t„) 

intersects the boundary T otherwise. 

= (1 - ^(^))/iL(x,t„), 



if K{x,tn) yf 0, or Af(^i(x) otherwise. Ei{F,w) is the local truncation error. 

The second term on the right-hand side of Eq. (2.8) is treated similarly. 
We obtain 

rtn r 



/ / F{r{9;x,t),9) w{r{9;x,t),9) drd9 

Jtn-i JQ{e) 



/i„_i Jr(o) 



n^(2)(x,t)E(x, t)w(x, t)dsdt + E 2 {F, w). 



(2.10) 



Here Afi'^i(x,t) = — t*{x,t) for (x,t) G T^'^i x where t*{x,t) = 

tn-i if r(0; X, t) does not backtrack to the boundary T during the time period 
or t*{x,t) G [tn-i,t] is the time when r{9;x,t) intersects the bound- 
ary r otherwise. <j)^^\x,t) = (1 - / K{x,t) if K{xA) 0 

or Afi‘^i(x,t) otherwise. E 2 {F,w) is the local truncation error. 
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Incorporating Eqs. (2.8)-(2.10) and the inflow boundary condition (2.2) 
into Eq. (2.3), we obtain the following reference equation 
r rtn r 



c(x,f„)rt;(x,t„)dx + 



/t„_i Jr(o) 



f (x, t) ■ n(x)c(x, t)?ii(x, t)dsdt 



= / c(x,t„_i)w(x,t+_Jdx + / (j)^'^\x,tn)F{x,tn)w{x,tn)dy 

Jn . Jn 



/t„_i Jr<.o) 
rtn n 



Jnn 



c(|(2)(x,t)v(x,t) • n(x)F(x,t) w(x, t) dsdt 
v(x, t) ■ n(x) g{x, t)w{x, t)dsdt + E{w), 



( 2 . 11 ) 



where ^(w) = J*" ^ J^c{x,t)[wt{x,t)+-v[x,t)-'^w{x,t) — K{x,t)w{x,t)]dxdt 
+ £’i(F, w) + i? 2 (E, w). 



2.3 A Numerical Scheme 



In the ELLAM scheme, the trial space Sh consists of piecewise linear (or 
d-linear) functions in fl at time and on L^*^) x Because of 

the boundary condition (2.2), no degrees of freedom should be introduced 
at at time Similarly, since the solutions are known at the time 
step tn-i, no degrees of freedom should be introduced at at time t„_i. 
However, to conserve mass, all the test functions should sum exactly to one 
0. Therefore, the test space Sh is obtained by modifying the trial space Sh- 
For a basis function associated with a node at L^-^^ at time we add it to the 
basis function associated with its adjacent node that is inside fl. The basis 
functions associated with the nodes at at time tn-i are treated similarly. 
An ELLAM scheme is formulated as follows: And c(x, t) G Sh which satisfies 
the boundary condition (2.2), such that \/w{x,t) G Sh 



c{x,tn)w(x,tn)dx + 



hr,-! Jr(0) 



v(x, t) ■ n(x)c(x, t)w{x, t)dsdt 



= / c(x,t„_i)u>(x,t+_i)dx + / <i)^'^\x,tn)F{x,tn)w{xAn)dy- 

Jn . Jn 



t„_i Jr(o) 
tn 



tr^-1 Jru) 



())^^^(x, t)v(x,t) • n(x)F(x,t) w(x,t) dsdt 
r{x,t) ■ n(x) g{x,t)w{x,t)dsdt. 



( 2 . 12 ) 



By using a Lagrangian coordinate, the ELLAM scheme (2.12) significantly 
reduces the temporal truncation errors and generates accurate solutions even 
if very large time steps are used. Moreover, it symmetrizes the governing 
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PDE (2.1), and generates a well-conditioned, symmetric and positive-definite 
coefficient matrix. Thus, the discrete system can be solved efficiently by, for 
example, the conjugate gradient method in an optimal order without any 
preconditioning needed. Furthermore, this scheme naturally incorporates the 
boundary condition (2.2) into its formulation and conserves mass. Finally, 
most terms in the scheme are standard in the FFM and can be solved in a 
straightforward manner. In the first term on the right-hand side, the trial 
and test functions are actually defined at different time steps. Hence, its 
evaluation could be very challenging and raises serious numerical difficulties. 
We use a forward Fuler or second-order Runge-Kutta tracking algorithm HH 
to evaluate this term. 

3 Description of ENO and WENO Schemes 

The FNO scheme was first introduced by Harten et al mum as an impro- 
vement over traditional fixed-stencil high-order FD interpolations, which are 
known to be oscillatory in nature especially near discontinuities of the soluti- 
ons. These oscillations do not decay as the mesh is refined and lead to further 
instabilities in the solution. By choosing the smoothest stencil among several 
candidates, FNO provides a uniformly high order approximation of the fluxes 
at cell boundaries while avoiding the spurious oscillations near steep fronts 
and shock discontinuities that are associated with traditional FDMs. As a 
further improvement, Liu et al mi provide Weighted FNO (WFNO) that is 
based on using a convex combination of the candidate stencils instead of just 
one. This treatment maintains the advantages of the original ENO and in 
addition obtains a higher-order accuracy. 

In this section we describe the ENO and WENO schemes of order k = 4 
for Eq. (2.1) defined on with a uniform partition Xi = iAx and yj = jAy. 
If we assume that no reaction or source term is present and let v = (Vi, V2)) 
Eq. (2.1) can be rewritten as 

ct + fx(c) + gy(c) = 0, with /(c) = lAc, g{c) = V 2 C, (3.1) 
We can write the spatially discretized ENO and WENO schemes as 

- fi-i/ 2 ,j) - -^{gi, 3 + 1/2 - k, 3 - 1 / 2 )- (3-2) 

Below we define the numerical flux fi+ 1 / 2,3 for a fixed /, and will skip 
the subscript j when it is clear from the context. The flux k+ 1 / 2,3 in the y- 
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direction is defined by symmetry. For the fixed j, we identify the cell averages 
Qi = Upwinding is guaranteed by computing the Roe speed given by 



O-i+112 — 



/(cj+l,j) /(cjj) 

Cij 



(3.3) 



The flux fi+i/ 2 j is then given by 



fi+l/2,j 



O'i+1/2 ^ 0) 
'lJ- 1 / 2 ’ otherwise. 



(3.4) 



The ENO and WENO schemes differ in the reconstruction procedure that 
provides '^i+i/ 2 - ENO scheme starts by computing the undi- 

vided differences for degrees 1 to fc = 4 

Q[a^i-l/2; a^i+ 1 / 2 ] = 9i) 

Q[^i — l/2^ • * • j ^z-t-j + 1/2] Q[^i+l/2j ■ • ■ 1 ^i+j-l-l/2] (^'^) 

Q\r^i—\j2 ' 1 ^i+j— 1/2] ■ 

The selection of the smoothest 5-point stencil (corresponding to k = A) starts 
with the two point stencil 



^={ 



Xi- 1 / 2 , Xi+ij 2 



}• 



(3.6) 



If 

\Q[xi-s/2iXi-i/2,Xi+i/2] \ < |Q[a;i-i/2, a;j+i/2, Xi+ 3 / 2 ] I, (3-7) 

then Xj_ 3/2 is added to the stencil S. Otherwise, Xj_|_ 3/2 is added. We repea- 
tedly add points in a similar manner until we come up with a 5-point stencil. 
Assume that the stencil S is composed of r cells to the left and s cells to the 
right of the current cell. We introduce 



3 

'^i-l-l/2 “ ^ ^t-1/2 

1=0 



3 

^ ) O— 
1=0 



(3.8) 



where 

_ ( J2l=0,l^m ^p=o,pyrm,i{x ~ P + 1) \ 

I nto,^^(m-/) ) ■ 

The WENO reconstruction is based on a convex combination of four dif- 
ferent reconstruction values 



3 3 

— o + ~ o 

^^+l/2 = Z^^-9,+ 1/2, Qi-1/2 = 

r—0 r—0 



(3.10) 
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(r) (r) 

where the reconstructed values '^ 1 - 1/2 defined 

by (3.8). The weights are given by 



a^ 

= =3 5 

^s=0 
~ Ctf' 

^ ’ 

^s=0 



Ck.'p 

Qly’ 



dr 

{e + (3r) 
ds-r 

{e + (dr) 



0 < r < 3. 



(3.11) 



Here the constants are do = 1/35, di = 12/35, d 2 = 18/35, ds = 4/35, and 
e = 10“®. The smoothness indicators are 



/^r — |^9i+3— r ^Qi+2—r H“ ^Qi+l—r Qi—r) 

|"(9i+2— r ^Qi+l — r d~ Qi—r) T (^i+S—r ‘^Qi+2—r d~ (^i-\-l—r) 

I (^i+l — r Qi—r) d~ ((7z+2— r Qi+l — r) d~ (<7i+3— r Qi+2 — r) 



1 

+ 2L 
1 

+ 3L 



(3.12) 



For n = 1,2,...,A^, a fully discrete ENO or WENO scheme with a fourth- 
order Runge-Kutta temporal discretization is defined by 



= c„_i -h ^L(c„_i), = c„_i -h ^L(c^^)), 

= c„_i 

At 



AtL(c(2)), 



(3.13) 



Cfi — 2 ' 



(-c„_i + c(i) + cu(2) + c(3)) 4- ^L(c(3)). 



4 Numerical Experiments 

The spatial domain is H = (—0.5, 0.5) x (—0.5, 0.5), a rotating velocity field is 
imposed as v(x) = v{x,y) = {—Ay, Ax)- The time interval is [0,T] = [0,7t/2], 
which is the time period required for one complete rotation. The initial 
condition cq(x) is given by 

co(x) = exp , (4.1) 

where Xc = {xc,yc), and a are the centered and standard deviations. The 
corresponding analytical solution for Eq. (2.1) with / = 0 is 

c(x, t) = exp - J K{r{9;x,0),9)d9^ , (4.2) 

wherex = (x,y) = (a; cos(4t)-|-?/sin(4t), — xsin(4t)-|-a; cos(4t)), and r(6*; x, 0) = 
(a;cos(40) — y sin (40), a; sin (40) -|- y cos(40)). 
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This example can be viewed as an incompressible flow in a two-dimensional 
homogeneous medium with a known analytical solution, and has been widely 
used to test for numerical artifacts of different schemes, such as numerical 
stability, numerical dispersion, spurious oscillations, deformation, and phase 
errors, etc. For the performance of the ELLAM scheme for problems with 
discontinuities, please see iHEnj. 



Scheme 


h 


At 


Max. 


Min. 


CPU 


CFL # 


Fig. # 


anal 


1/64 


N/A 


N/A 


1 


0 


N/A 




ELLAM 


1/64 


7t/8 


0.9987 


0 


Im 5s 


71.25 


1 


ENO 


1/64 


7t/580 


0.8757 


0 


Im 10s 


0.98 






1/64 


7T/6000 


0.8756 


0 


12m 5s 


0.09 






1/96 


7t/860 


0.9552 


-0.0004 


3m 45s 


0.99 






1/96 


7T/4000 


0.9552 


0 


17m 26s 


0.21 






1/196 


7T/1700 


CK3 


CX3 


30m 2s 


1.0 






1/196 


7T/2400 


0.9443 


-0.0040 


42m 7s 


0.71 






1/256 


7T/2400 


0.9967 


-0.0311 


Ih 15m 


0.95 






1/256 


tt/SOOO 


0.9836 


-0.0013 


Ih 34m 


0.76 


2 




1/512 


7T/6000 


0.9993 


0 


12h 52m 


0.76 




WENO 


1/64 


7t/580 


0.9617 


0 


2m 17s 


0.98 






1/64 


7T/6000 


0.9617 


0 


23m 37s 


0.09 






1/96 


7t/860 


0.9832 


0 


7m 34s 


0.99 






1/96 


7T/4000 


0.9832 


0 


34m 57s 


0.21 






1/196 


7T/1700 


0.9964 


0 


Ih Im 


1.0 






1/196 


7T/2400 


0.9964 


0 


Ih 27m 


0.71 






1/256 


7T/2400 


0.9979 


0 


2h 31m 


0.95 


3 




1/256 


tt/SOOO 


0.9979 


0 


3h 9m 


0.76 






1/512 


7T/6000 


0.9995 


0 


Iday Ih 


0.76 





Table 1. The Performance of the ELLAM, ENO, and WENO. 

In the example runs, the data are chosen as follows: K = f = 0, Xc = 
(—0.25,0), cr = 0.0447. A uniform spatial grid h = Ax = Ay = ^ and a 
time step of At = tt/8 are used in the ELLAM simulation. A second-order 
Runge-Kutta method with a micro-time step of Atm = At/ 20 is used to 
track the characteristics. For the given h, a largest admissible (satisfying 
the CFL condition) time step of At = ^ is used in the ENO and WENO 
simulations. We then systematically reduce the sizes of h and At to examine 
the performance of ENO and WENO schemes until their solutions are roughly 
compatible with the ELLAM solution. The numerical results are presented 
in Table 1 with selected solutions are plotted in Figures 1-3. 
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5 Summary 

We present an ELLAM scheme for linear advection-reaction PDEs in mul- 
tiple space dimensions and carry out numerical experiments to compare the 
performance of the ELLAM scheme with ENO and WENO schemes. We have 
the following observations: (i) Even though very large time steps and coarse 
spatial grids are used, the ELLAM scheme still generates much more accurate 
solutions than those obtained with the ENO and WENO schemes with much 
finer spatial and temporal grid sizes. Hence, the ELLAM scheme outperforms 
ENO and WENO schemes in the context of linear transport PDEs. (ii) The 
mathematical models used to describe multiphase and/or multicomponent 
fluid flow processes in porous medium are typically strongly coupled systems 
of nonlinear PDEs jSJElEni- Nevertheless, most of these PDEs govern the 
transport of different components in these fluids and are advection-dominated 
PDEs that are linear in terms of the concerned concentrations. Thus, after 
some decoupling/linearization techniques are applied, these transport PDEs 
are reduced to linear advection-dominated transport PDEs at each time step. 
Therefore, ELLAM schemes are expected to perform very well for porous me- 
dium flow problems, which has in fact been confirmed in uni. ( iii) In this 
paper the ELLAM scheme is compared with the ENO/WENO schemes in 
terms of linear transport PDEs. The latter were originally developed for 
nonlinear hyperbolic conservation laws and have been successful in resolving 
shock discontinuities and complex solution structures. The investigation on 
the performance of the methods for nonlinear hyperbolic conservation laws 
are beyond the scope of this paper. 
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Abstract 

An Eulerian-Lagrangian localized adjoint method (ELLAM) is pre- 
sented for compressible flow occurring in compressible porous media 
with wells. The ELLAM scheme symmetrizes the governing transport 
equation, greatly eliminates non-physical oscillation and/or excessive 
numerical dispersion present in many large-scale simulators widely used 
in industrial applications, and conserves mass. Computational experi- 
ments show that the ELLAM scheme can accurately simulate incom- 
pressible and compressible fluid flows in porous media with wells, even 
though coarse spatial grids and very large time steps, which are one or 
two orders of magnitude larger than those used in many numerical me- 
thods, are used. The ELLAM scheme can treat large mobility ratios, 
discontinuous permeabilities and porosities, anisotropic dispersion in 
tensor form, and wells. 



KEYWORDS: characteristic method, compressible flow, Eulerian-Lagrangian 
methods, porous medium flow, wells 

1 A Mathematical Model 

The objective of subsurface fluid flow modeling is to simulate complex fluid 
flow processes occurring in subsurface porous media sufflciently well to op- 
timize the recovery of hydrocarbon or to accurately predict and thoroughly 
remediate the contamination in groundwater transport processes. In order to 
do this, one must build mathematical models to describe the essential pheno- 
mena and the fundamental laws, and design numerical methods to discretize 
these models and to represent the basic features as well as possible without 
introducing serious nonphysical phenomena. 

Let p(x, t) and u(x, t) be the pressure and the Darcy velocity of a fluid 
mixture, and c(x, t) be the concentration of an invading fluid or concerned 
solute/solvent in the fluid mixture. The equation of mass conservation for 

Z. Chen, R.E. Ewing, and Z.-C. Shi (Eds.): Proceedings 1999, LNP 552, pp. 324-^^^ 2000. 

(c) Springer-Verlag Berlin Heidelberg 2000 
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the fluid mixture and Darcy’s law lead to the following coupled system of 
partial differential equations (PDEs) that describes fluid flow processes in a 
porous medium reservoir with injection and production wells moil 

( 1 . 1 ) 



In many cases, the thickness of the medium is significantly smaller than its 
length and width. Hence, it is reasonable to average the medium properties 
vertically and to assume H C with a nonuniform local elevation. K(x) is 
the permeability tensor of the medium, /r(c) is the concentration-dependent 
viscosity of the fluid mixture, which is determined by some mixing rule 

fr{c) = - c) + Mic]~‘^ (1.2) 

where is the viscosity of the resident fluid and M is the mobility ratio. 
p is the density of the fluid mixture, g is the magnitude of gravitational 
acceleration, d(x) is the reservoir depth, g(x, t) is a source and sink term that 
accounts for the effect of injection and production wells, (f> is the porosity of 
the medium (proportion of volume available to porous medium flows). 

For a fluid of constant compressibility Cp, the following equation of state 

p= Prexp{cp{p-pr)) (1.3) 

holds, where pr is the reference density at the reference pressure Pr- Eq. (1.3) 
and its simplified versions have been widely used in modeling subsurface 
contaminant transport and remediation in the hydro-science community. It 
can also be applied to compressible fluid flow processes in reservoir simulation, 
unless the fluids contain large quantities of dissolved gas 00 . 

Due to the effect of large pressure changes involved in porous medium 
fluid flow processes and the type of the medium of the reservoir, the porous 
medium can deform. Let c^(x) be the compressibility of the medium. The 
following constitutive relation is often used to model the porosity (j) 0 

cj)= cj)r{x)exp{c^{x){p- Pr)), (1.4) 

where (j)r is the porosity of the medium at the reference pressure Pr- 

Incorporating Eqs. (1.3) and (1.4) into the system (1.1), and introducing 
the mass flow rate cr = pu as a primary variable, we obtain the following 



— (#)-kV-(pu) =q, 



K 

p(c) 



{Vp- pgVd), 



xefl, te(o,T] 
X e fi, t e (o,T] 
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system of PDEs for the pressure p and the mass flow rate cr 
dv 

<S'p(x,p)^ + V • cr X e ff, t e (0,T], 

= 

p{c) 

where Sp(pc,p) is the storage term defined by 

Sp{x,p) = = p(/)(x,p)(c0(x) + Cp). (1.6) 

The equation of mass conservation for the concerned component can be 
expressed in terms of the mass flow rate cr as 

+ V • (CTC- D(cr,p)Vc) = c*g, x € t G (0,T], (1.7) 

ot 

where c* is a prescribed concentration at sources or is equal to c at sinks, 
D(cr,p) is the diffusion-dispersion tensor that consists of molecular diffusion 
and (anisotropic velocity-dependent) mechanical dispersion 

T>{(T,p) = dm(l>pl + dt\(T\l+ ^'' (1-8) 

|cr| V ) 

cr = (ca;, <Jy), dm is the molecular diffusion coefficient, I is the identity tensor, 
and dt and di are the transverse and longitudinal dispersivities, respectively. 

System (1.5) and (1.7) needs to be closed by the initial and boundary 
conditions. In petroleum reservoir simulation the boundary dLl is typically 
impermeable, leading to no-flow boundary conditions of the form Hi 

cr • n = 0, (D(cr,p)Vc) • n = 0, (x, t) £ (911 x [0, T]. (1.9) 

These conditions also arise in environmental modeling although other types 
of boundary conditions are possible 0. For simplicity, we assume boundary 
conditions (1.9) and a rectangular domain fl = x (ay,by) 00. 

If the fluid and the medium are incompressible, p = constant and (j) = 
4>{'x.). The system (1.5) and (1.7) is reduced to the mathematical model for 
incompressible fluid flow, which has been widely used previously 000. 

Because diffusion or dispersion is often a small phenomenon relative to 
advection, Eq. (1.7) is an advection-diffusion equation with advection being 
the dominant phenomenon. Additional features of (1.5) and (1.7) include 
the singularities of the solutions at wells, discontinuous permeabilities and 
porosities, a large adverse mobility ratio in the flow processes that could 
cause viscous Angering phenomena, anisotropic dispersion in tensor form, as 
well as the enormous size of field-scale applications. 



{Vp-pgVd), X G fl, t G (0,T], 
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2 An ELLAM Scheme 



We use a sequential decoupling and linearization technique for system (1.5) 
and (1.7), and a mixed finite element method to solve p and cr from (1.5) 
0IO|. For simplicity, we describe only an ELLAM scheme for Eq. (1.7) 
assuming that the pressure p and the mass flow rate cr in (1.7) are known. 

The ELLAM was originally introduced by Celia, Russell, Herrera, and 
Ewing for the solution of (one-dimensional constant-coefficient) advection- 
diffusion PDEs 13 0. Let 0 = to < < • • • < < ■ . . < tN-i < In = T 

be a partition of the time interval [0,T] with At„ = tn — tn-i- In the 
ELLAM formulation, we multiply (1.7) by space-time test functions w that 
are continuous and piecewise smooth, vanish outside the space-time strip 
12 X (tn-i,tn\, and are discontinuous in time at time t„_i. This yields a 
space-time weak formulation 



(l>pc(x, tn)w{x, tn)dx + 



/ tn — i J 






(j)p 



tn — 1 ^ 

d wjy,9) 

de 



Vu;(y, 9) ■ D(cr,p)Vc(y, 9)dyd9 



Vu>(y,6») 



dyd9 



( 2 . 1 ) 



= / (j)p c{x,tn-l)w{x,t:^_^)dx + 



' tn — \ ^ ^ 



{c*qw){y,9)dyd9. 



Here w{x,C^ ,) = lim w(x,t) to take into the fact that w(x,t) is 
discontinuous in time at time tn-i- 

Careful analysis of various operator splittings in the ELLAM framework 
concludes that the test functions w{y, 9) in (2.1) should be chosen to satisfy 
the hyperbolic part of the adjoint equation of (1.7) (e.g., see P|) 

0P -^(y,^') + cr- Vw(y,6>) =0, yell, 6» e 2„]. (2.2) 

Equation (2.2) implies that the test functions w{y, 9) should be constant 
along the characteristics y = r(0;x,t„), defined by the differential equation 



dr 

d9 



r(6»;x,i) 




9 € [tn— 1 7 tn] , 



(2.3) 



In the ELLAM scheme, we choose the test functions w{x, tn) to be piecewise- 
bilinear functions for x £ 12 at time tn and define them by constant extension 
along the characteristics r(0;x,t„) to the space-time strip 12 x (tn-i,tn]. 
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We enforce the Euler quadrature at time to evaluate the source and 
sink term in Eq. (2.1). Note that for any {y,9) G Lt x [tn-i,tn], there exists 
an X G n such that y = r(0; x, t„). We obtain 



tn — 1 ^ ^ 



c*{y,9)q{y,0)w{y,0) dyd0 



c*(r(0; X, tn),0)q{r{0-, x, <„), 6»)w(x, t„) 

- 1 

= Atn / C*{x,tn)q{x,tn)w{x,tn)dx+ Eq{c* ,W), 

Jn 



d{r,0) 

i9(x,t„) 



d0dy (2.4) 



d{r,0) 



is the local truncation error. 



= l+O{tn—0) is the Jacobian of the transformation, Eq{c*,w) 



We can evaluate the diffusion-dispersion term similarly and obtain 



/ t-n — i J Q 

= At, 



Vw(y,6») • D(cr,p)Vc(y,6») dyd0 



/ Vw{x,tn) ■ B{(T,p){x,tn)yc{x,tn)dx+ Ei 3 {c,w), 

Jn 



(2.5) 



where E-d(c,w) is the local truncation error term. 

Using Eq. (2.2), we see that the last term on the left-hand side of Eq. (2.1) 
vanishes along the characteristics r(0;x,f„) defined by Eq. (2.3). However, 
for a general mass flow rate field cr(x,f), porosity 4>{x,p) and density p{p), 
one cannot analytically solve the problem (2.3) to track r(0;x,t„) exactly. 
Hence, numerical means have to be used to approximate the characteristics. 
For linear transport PDFs where the fluid velocity is assumed to be a known 
smooth function, we were able to utilize an Euler or a Runge-Kutta quadra- 
ture to track the characteristics and to obtain accurate numerical solutions 
HD. For the coupled system (1.5) and (1.7), a semi-analytical tracking algo- 
rithm should be used [D- When a numerical method is used to approximate 
the characteristics r(0; x, t„), the last term on the left-hand side of Eq. (2.1) 
does not vanish. Nevertheless, it has been proven that dropping this term 
does not affect the optimal order convergence rate of the ELLAM scheme 

HD). 

In the ELLAM scheme, we substitute (2.4) and (2.5) into Eq. (2.1) and 
drop Eq{c*,w), E-£){c, w), and the last term on the left-hand side of Eq. (2.1). 
We define the trial functions c(x, t„) to be piecewise bilinear functions on H 
at time step as in the standard finite element method. Note that the trial 
functions coincide with the test functions on Ll at time level But the 
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trial functions c are defined at time step tn only while the test functions w 
are defined on the space-time strip x (tn_i, by constant extension along 
characteristics from at time . This leads to the following ELLAM scheme 



By using a characteristic tracking, the ELLAM scheme (2.6) significantly 
reduces the temporal truncation errors and generates accurate numerical so- 
lutions even if very large time steps are used. Moreover, the ELLAM scheme 
conserves mass [3|, which is of essential importance in applications. Further- 
more, the ELLAM scheme symmetrizes the transport PDE (1.7) and gene- 
rates a 9-banded, symmetric and positive definite coefficient matrix. Finally, 
except for the first term on the right-hand side, all other terms in (2.6) are 
standard in the finite element method and can be computed in a straightfor- 
ward manner. In this term, the trial and test functions are actually defined 
at different time steps. Hence, the evaluation of this term is very challenging 
and raises various numerical difficulties . We refer interested readers to Cl 
for detailed implementational issues. 

3 Numerical Experiments 

In this section we apply the ELLAM scheme (2.6) to solve the system (1.5) 
and (1.7). The test problems are standard five spot pattern displacements 
in reservoir simulation. Example 1 is the simulation of an incompressible 
fluid flow in a rigid homogeneous medium, while Example 2 is the simulation 
of a compressible fluid flow in a compressible, heterogeneous medium. The 
(common) data in these experiments is as follows: The spatial domain 12 = 
(0,1000) X (0,1000) ft^, the viscosity of oil p,o = 1.0 cp, the mobility ratio 
M = 41. The injection well is located at the upper-right corner (1000, 1000) 
with an injection rate of g = 30 ft^/day and c* = 1.0. The production well 
is located at the bottom-left corner (0,0) with a production rate of g = 30 
ft^/day. A spatial grid of Ax = Ay = 50 ft, and a time step of At = 360 
days (one year) are used. Previously, the time steps used in these simulations 




( 2 . 6 ) 



/ <(>(x,p(x, t„_i))p(p(x, 2„_i))c(x, 2„_i)w(x, 2+_i)dx 
Jn 

+Atn / C*{x,tn)q{^,tn)w{x,tn)dx. 
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(a) Contour plot at t = 5 years. 



(b) Contour plot at t = 10 years. 



Figure 1: The concentration plots of invading component in Example 1. 



range from a few days for the upwind finite difference method (UFDM) to 
one month for the modified method of characteristics (MMOC) O I0| ■ 

Example 1: Incompressible flow in a rigid homogeneous medium. 

In this example run, we choose the remaining data as follows: The porosity 
(j) = 0.1, the permeability coefficients (diagonal entries) are = ky = 80 md, 
the molecular diffusion is Dm = (t>dm = 0, the longitudinal and transverse 
dispersions are Di = (j)di = 4.0 ft and Dt = (j)dt = 0.4 ft, respectively. The 
initial concentration is co{x,y) = 0. The contour plots for the concentration 
of the invading fluid at 5 and 10 years are presented in Fig. 1(a) and 1(b). 

Example 2: Compressible flow in a compressible, heterogeneous 

medium. In this example run, the compressibility of the medium = 
0.000001, the reference density pr = 0.8 g/cm^ = 49.942 Ib/ft^, the com- 
pressibility of the fluid Cp = 0.0001, the reference pressure Pr- = 1 atm. = 
14.696 psia, the initial pressure po{x,y) = 3000 psia, and the initial concen- 
tration co{x,y) = 0.0. The subdomain = (150,600) x (150,600) ft^. 
The remaining parameters on this subdomain are as follows: The reference 
porosity (pj. = 0.09, the permeability coefficients k^ = ky = 35 md, the mole- 
cular diffusion coefficient Dm = 0.0 ft^/day, the longitudinal and transverse 
dispersions are Di = 3.6 ft and Dt = 0.36 ft, respectively. On the subdomain 
the reference porosity (pr = 0.1, the permeability coefficients 
kx = ky = 80 md, the molecular diffusion coefficient Dm = 0.0 ft^/day, the 
longitudinal and transverse dispersions are Di = 4 it and Dt = 0.4 ft, res- 
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(a) Contour plot at t = 5 years. (b) Contour plot at t = 10 years. 

Figure 2: The concentration plots of invading component in Example 2. 



pectively. The contour plots for the concentration of the invading fluid at 5 
and 10 years are presented in Fig. 2(a) and 2(b). 

These results show that the ELLAM scheme can accurately simulate in- 
compressible and compressible fluid flows in porous media with wells, even 
though coarse spatial grids and very large time steps, which are one or two 
orders of magnitude larger than those used in many numerical methods, are 
used, implying a significantly improved efficiency and accuracy. The ELLAM 
scheme can treat large mobility ratios, discontinuous permeabilities and poro- 
sities, anisotropic dispersion in tensor form, and wells. The ELLAM scheme 
conserves mass. 
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Abstract 



Based on Tikhonov’s regularization method, this paper applies two 
fast convergent algorithms developed by the authors to solving 2D 
integral equations of the first kind. The procedures of discretization 
and regularization are discussed. The numerical tests are presented to 
show high efficiency and numerical stability. The integral equations 
of the first kind can be seen in determination of capillary pressure 
functions. 

KEYWORDS: regularization method, fast convergent algorithm, 2D integral 
equation of the first kind 

1 Introduction 

In many application areas, such as determination of capillary pressure func- 
tions in porous media, signal reconstruction, image restoration, and operator 
identification ^ El 001, we encounter integral equaitons of the first kind. As 
a genaral framework of dealing with inverse problems, equations of this kind 
are inherently ill-posed and solving them is time consuming. Thus some kind 
of regularization methods must be employed. So far, many efforts have been 
put to the ID case El s SI HOI mu uni, while few successful results have 
been obtained for the 2D case. The main difficulty lies in the huge amount 
of computation time in the iterative process of choosing an appropriate re- 
gularization parameter. This results in need for the study of fast convergent 
algorithms. 

Let us consider the general 2D integral equation of the first kind 




K{x, y, C, r])d^dr] = u{x, y), {x, y) G [02, 62] x [c2, ^2]- 

( 1 . 1 ) 



Z. Chen, R.E. Ewing, and Z.-C. Shi (Eds.): Proceedings 1999, LNP 552, pp. 333-^^^ 2000. 
(c) Springer-Verlag Berlin Heidelberg 2000 
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The kernal K{-,-) is continuous and A : W 2 [ai,bi; di] — ?► T^[u 2 , 62 ; C 2 , ^ 2 ] 
is a bounded linear and compact operator. As known, when the range of A, 
i.e., R{A), is not closed, (1.1) is ill-posed. 

Generally speaking, the right-side member u and the operator A of (1.1) 
are roughly known. Suppose that the approximate version of the input data 
(A,u) is {Ah, Us), with the error 

\\Ah-A\\<h, ||it - wall < (5, h, <5 > 0. ( 1 . 2 ) 

Due to the ill-posedness of this problem, we cannot take given by 

Zn = {Ah)^us, rj = {h,S) (1.3) 

as the stable approximation to the exact solution of the problem ( 1 . 1 ): 

zt = A~^u, (1.4) 

where A+ and denote the generalized inverses of the related operators. 
To get a stable approximation of the generalized solution of A~^us (or A'l^us), 
we employ Tikhonov’s regularization method, which consists of the steps 

1. Constructing the regularized operator R{u,a); 

2. Selecting the regularization parameter a = o:((5) (or a = a{S, h)) 
matching the error ry = {S, h) of the input data. 

The regularized solution R{us,a) can be obtained by minimizing the 
smoothing functional 

M°‘[z,us] = \\Az- us\\^ + aO\z\, a > 0, (1.5) 

or 

M°"[z,us] = \\AhZ - usW^ + an[z], a > 0. (1.6) 

Obviously, the minimizer zf or must satisfy the Euler equation 

{A*A + aVL')z = A*us, (1.7) 

or 

{Ah*Ah + an')z = Ah*us. ( 1 . 8 ) 

As a posterior strategy, the regularization parameter should satisfy the so- 
called discrepancy equation pain) 



<l,{a) = \\Azt-us\\^-5^ = Q, 



(1.9) 
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or the generalized discrepancy equation m 



(j){a) = \\AhZ^ 



usf-{5 + h\\z'^\\r = 0 



( 1 . 10 ) 



Here (or A*) is the Hilbert-adjoint operator of Ah (or A), n[-] denotes 
the identity operator or the positively defined, linearly differentiable operator, 
and n'[-] denotes the Frechet derivative. 

Denote the root of (1.9) or (1.10) as a = o;((5) or a = 0(77). Then the 
regularized solution R{us,a{S)) or R{us,a{r])) is the stable approximation 
to the exact solution zt- 

The paper is organized as follows. §2 discusses the discretization and 
regularization of (1.1). §3 presents two fast convergent algorithms for solving 
(1.9) or (1.10) and gives the related convergence theorem of the algorithms. 
The numerical tests are presented in §4 to compare the efficiency of the new 
algorithms with Newton’s method. 

2 Discretization and Regularization 

We use a finite-difference method to discretize (1.1). Let the step sizes in the 
direction of 77, x, and y be hi, ^2, h^, and ^4, respectively. The difference 
scheme of (1.1) is 






( 2 . 1 ) 



where 




K, J, L, and M are given positive integers, and 
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Define the norms || • and || • by 

\\^hih2\\2 — ( .yhih2 , , 

Ik \\w^i^2 ~ )W^1^2^ 

K J K J 

= X) I] hih2Zkjfkj + X X hih2AzkjAfkj, 

k—lj=l k—lj—1 

Ay, ^k + l,j~^kj I ^k,j + l~^kj 

— hi h2 ^ 

k = l,2,...,K, j = l,2,...,J, ZkjiTkjeWAA^ 

11X3^4 = 

L M 

= X X h^hiUlinVlmi V^^4 g ih^h^ ^ 

1 — 1 m—1 

where f^j and vi„i are the conjugates of r^j and vim, respectively. 

We can show that is the difference expression of ||.z(^, ?7) ||^i 

given by 

M^,v)rw^^=J^ (k(?,^)|K|^ + ^PKdr7. 

Then the discrete regularized solution of (1.1) can be obtained by minimizing 
the discreted Tikhonov functional 



( 2 . 2 ) 

The following theorem is classical. 

Theorem 2.1 For any g ih^hi positive number a, there 

exists a unique element such that 



and satisfy the Euler equation 

{A*A + aB)z’^^^^ = A*u^A 4 ^ 
where A* is the adjoint matrix of A and B is in the form 



B = 



h^hi 



C -H 

-H C -H 

-H C -H 



-H C -H 

-H C 



(2.3) 



(2.4) 



J LxM 
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C = 



h^h. 



3'H 



hi 



2 (^ i +^2 I 9 

hlhl + ^ 
-1 
TA 



-1 

■2{hl+hl) 



-1 

7? 



-1 

TA 



2(/lj+/t2) I r> 

hlK + ^ 
-1 
hi 



-1 

■^(hl+h\) 



+ 2 



H = 



/j-O 



1 

7iT 



-■ KxJ 



^2 J KxJ 

For the proof, the first part is mainly based on the completeness of the 
space , while the remaining part is a common business of variational 

calculas. 

We can see that i? is a 5-diagonal matrix and is of the diagonal dominance, 
so B is non-singular. Thus (17) can be rewritten as 



{B~^A*A + = B~^A*u’^^'^C 



(2.5) 



It is more convenient to solve (2.5). By the way, (2.4) or (2.5) is a regulariza- 
tion of (2.1). Considering the huge amount of computation, solving it along 
with the discrepancy equation (1.9) or (1.10) is not easy. This urges us to 
find efficient methods to solve this problem. 



3 Fast Convergent Algorithms 

Without loss of generality, we only consider the method for solving equation 
(1.9). To do this, we first quote some results for the existence of the root and 
the convergence of the corresponding regularization solution . We define 

/3{a;us) = \\Azs -us\\, (3.1) 

the discrepancy or discrepancy function of the approximate solution z“ [71, 
where Zg is the solution of (1.7). 

The following theorems are useful in our further considerations. 

Theorem 3.1 Assume that u G R{A) (the range of A) and the signal-to- 
noise ratio of the input data us is ||u5||/i5 > 1, i.e., 



||u - us\\ <S< ||U 5 ||. 



(3.2) 
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Then the function a — >■ /3(a; ug) is continuous and non- decreasing, and 5 
belongs to the range of the function. 

From the above theorem, there exists a unique a* = a*((5) satisfying the 
discrepancy equation 

fi{a*{5)-us) = 5, (3.3) 

or 

4>{o*m=Q. (3.4) 



Theorem 3.2 Under the same conditions of Theorem 3.1, if the regulariza- 
tion parameter a = a(<5) is determined by (3.3) or (3.4), then we have 

lim = zt ■ (3.5) 

< 5->-0 

The proof of the above theorems is analogous to that of the theorems in 
|Z]. In addition, by differentiating the two sides of (1.7) with regard to a 
directly, we can easily obtain the following theorem. 

Theorem 3.3 For any regularization parameter a > 0, the solution Zg of 
(1.7), along with the function 4>{a), is infinitely differentiable with respect to 
a and we have 

{A* A + an'){z^y = -n'z2, (3.6) 

{A*A + an'){zyy^^ =-kn'{zyy^-^\ fc = 2,3,.... (3.7) 

In principle, there are two ways to improve the efficiency and decrease 
the amount of computation in dealing with the discrepancy equation (1.9). 

1. Try one’s best to improve the convergence rate while solving the equa- 
tion (1.9) by iterative methods. By making full use of the good property of 
the function </>(a) (it is infinitely differentiable about a) and the higher-order 
Taylor’s expansion, we may construct algorithms which possess a higher-order 
convergence rate. 

2. Associated with the above tactics, we should decrease the total amount 
of computation for evaluating the functions 4>{a), 4>'{p), and 4i'{q). This can 
be accomplished by Cholesky’s decomposition, along with three times of the 
back substitution. 

Let us state the iterative formulas of two algorithms. 

Given, by truncating the Taylor expantion of (j){a) after the (a — a„)^ 
term at a = a„, we obtain the two iterative formulas. 
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Iterative formula 1: 



Q^n+1 — 



2(/>(a«) 

+ sign{4>'{an))\l ~ 2(/>(a„)(/)"(a„) 



(3.8) 



Iterative formula 2: 



4 * (^n) / \2 

(()'(«„) 2(()'(a„) (/)'(«„) 



(3.9) 



Below we see that the iterative formula (3.8) or (3.9) possesses the pro- 
perty of a third-order convergence rate. It should be pointed out that (j>{a„), 
4>'{an), and (f>''{a„) can be computed as follows: 

fj r 

^'(a) = -2a{^,z^), (3.10) 

where dza/da, and d'^Za/do^ can be obtained by solving a series of equa- 
tions 

{A*A + akn')z^,= A*us, (3.12) 

{A A + auD, )z^^ = —fl Zctf., (3.13) 

{A*A + akfl')z':,^=-2fl'z'^^. (3.14) 

From (3.12)-(3.14) we can see that their differences only lie in the right- 
hand side. This hints that, in one cycle of the iterations, by using Cholesky’s 
decomposition only once with three times of back substitutions, the vectors 
Zq,^, z'^^, and z"^_ are obtained. In this way, the computation time is greatly 
saved. 

Assume that the operator equation of the first kind (1.1) has been discre- 
tized and regularized. Then the iterative scheme can be given as follows. 

Algorithm 1. 

step 1: input oq, d, e, A, and u, and set k = 0; 

step 2: Give a decomposition: A^A + akfl' = LkL}A\ 

step 3: Solve the equation: Lky = A^ug, z'^’‘ = y; 

step 4: Compute (p{ak), if \(p{ak)\ < e; then go to step 10; 

step 5: Solve the equation: L^y = —Zg'", L}A {z^A = y; 

step 6: Compute 4>'{ak), set dk = Newton step iteration 



O^k ■ — Otk dk) 



(3.15) 
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step 7: Solve the equation: Lky = —2{zg’‘y, Lk^ = y; 
step 8: Compute 4>"{ak) and set pk = 4>'{ak)'^ — 2(j){ak)4>" {ctk)] if Pk < 0, 
then go to step 9; else, proceed with the another iteration 



2(j){ak) 

ak ■= OLk r; 

4>'{ak) + sgn{(j)'{ak))pl 



(3.16) 



step 9: Set k\=k + 1 and go to step 2; 
step 10: Set a* = Ofc and z^* = z^*‘\ stop. 

Algorithm 2. 

steps 1-7, 9, and 10 are the same as in Algorithm 1; step 8 is changed as 
step 8: 

_ .2 /qivN 

ak-ak 

For the above two algorithms, we have the following convergence theorem. 



Theorem 3.4 Assume that 4>{a) has a root at a = a*: (f){oi*) = 0. Then 
3e > 0 such that, Voq G U{a*,e), the sequence {a/c}^i generated by Algo- 
rithms 1 and 2 converges at a third-order rate to a*. 

For the proof of the theorem, see m- 




Newton 

new1 

new2 







Figure 1: Convergent rate comparison and exact solution. 



Notice that after the decomposition is obtained, the amount of computing 
the fcth order derivative is 0{n). Thus, in solving equation Cx = b 

and computing 4>'{a), their total amount of computation is 

about n^/6. Thus, by employing our new algorithms, the convergence rate is 
greatly improved and the CPU time is also greatly saved. Moreover, if we let 
N denote the iterative steps of Newton’s method and M denote the iterative 
steps of the new algorithms, then as the order number n is sufficiently large. 
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the ratio of the amounts of computation (so is the CPU time) is near to 
N/M. 

For (1.10), our algorithms are also applicable. It needs only some modi- 
fications of the expression <p'(a) and <p"(a). 



4 Numerical Experiments and Conclusions 



To compare the efficiency, we implement numerical experiments. Let us de- 
note the Newton method as Newton and let newl and new2 represent Algo- 
rithm 1 and Algorithm 2 based on (3.8) and (3.9), respectively. The numerical 
experiments are done with MATLAB on an IBM-PC machine. 

Consider the integral equation (see ^) 

Az = f j exp{—^Q[{x — s — Q.f>Y + {y — t — Q.bY]}z{s,t)dadt = g{x,y), 



0 Jo 



(4.1) 

where the kernal is K{x,y; s,t) = exp{—%Q[{x — s — 0.5)^ + {y — t — 0.5)^]}, 
the interval is [0,1] x [0,1], and x,y G (0,2). The exact solution is 



z{s,t) = 



0.03 



a-0.7)^ 



0.955040800 



\ {t-0.3)“^ 

- 0.052130913 e — 






We use the compound trapezoid formula to discretize (4.1). Suppose that 
= {(s, t)|0 < s < 1, 0 < t < 1} and the related points are equally spaced 
points: 



Si = ihi, hi = 1/K, i = 0,l,---,K,tj = j/i2, h2 = l/J, j = 0, 1, . . . , J. 
Then a series of small rectangle are 

Dij — {(s,t)|sj ^ Sj+1, tj ^ 

where i = 0, 1, . . . , AT and j = 0, 1, • • • , J. 

Applying the trapezoid integral formula on each sub-rectangle, then the 
compound trapezoid formula of 2D is found 

L M 

EE AKjz{sutm) = g{xuym), (4.2) 

l—O m—0 



Akj 



1 

4ATJ 



K J 

EE PijKi^Xij ym'i 

2^0 j —0 



where 
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Figure 2. Computed solutions. 



and pij, i = 0, 1,2, . . . , K, j = 0,1,2, ..., J, are the elements of matrix P: 
P = {Pij)(K+l)y.{J+l), with 



■ 1 


2 


2 


... 2 


2 


1 ■ 


2 


4 


4 


... 4 


4 


2 


■ to 


4 


4 


... 4 


4 


2 


2 


4 


4 


... 4 


4 


2 


2 


4 


4 


... 4 


4 


2 


1 


2 


2 


... 2 


2 


1 



So the discrete matrix is 

(4.3) 

Due to the ill-posedness of (4.1), (Akj)lxm is ill-conditioned. Thus we 
should employ a regularization method to get its stable numerical solution. 
In addition, to compare the stability of the above three algorithms, we give 
a strong perturbation to the right-hand member of (4.3): 

where co = 500. Let S = 5.9161e — 6 and Oq = 0.001. The results are shown 
in Figs. 1 and 2 and Table 1. 

The left figure in Fig. 1 is the convergent rate comparison of three al- 
gorithms and illustrates that newl and new2 converge to the regularized 
solution more rapidly than Newton. Meanwhile newl is the fastest one. Un- 
der the case that K = J = L = M = 35, the concrete computed results of 
the three algorithms are shown in Table 1. 
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The numerical test also indicates that the three algorithms all converge 
well. They are not sensitive to the perturbation of the right-hand member. 
This can be seen from the right figure in Fig. 1 (the exact solution of the 
integral equation with disturbing right-hand members of newl method) and 
the left figure in Fig. 2 (the computed solution of the integral equation with 
disturbing right-hand members of newl method). The right figure in Fig. 2 
(the computed solution of the integral equation with disturbing right-hand 
members without using the regularization method) illustrates that without 
using the regularization method, the computed solution cannot be accepted. 



Algorithms 


iterative steps 


time consuming 


a* 


relative error 


Newton 


13 


1.04 


1.2502e-6 


2.6440e-7 


newl 


4 


0.38 


1.2502e-6 


2.6438e-7 


new2 


10 


0.99 


1.2502e-6 


2.6438e-7 



Table 1. Comparison of three algorithms at K = J = L = M = 35. 



In summary, the new algorithms for solving 2D integral equations of the 
first kind in this paper are stable and effective and newl may be used as a 
good solver. In the future, we will consider the 3-D case and the nonlinear 
case. 
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A Two-Grid Finite Difference Method 
for Nonlinear Parabolic Equations 

ZiTiNG Wang Xianggui Li 



Abstract 

A two-level finite difference scheme is given for the approximation 
of nonlinear parabolic equations. The analysis of the scheme is given 
by assuming an implicit time discretization. In this two-level scheme 
the full nonlinear problem is solved on a coarse grid of size H. The 
nonlinear term is expanded about the coarse grid solution and an ap- 
proximate interpolation operator is used to provide values of the coarse 
grid solution on the fine grid in terms of superconvergent node points. 
The nonlinear equations analyzed arise in the flow and transport pro- 
cesses of fluids in porous media and the analysis carried out applies to 
the numerical simulation of these processes. 



KEYWORDS: finite difference, two-grid method, nonlinear parabolic equa- 
tion, error estimate, superconvergence 

1 Introduction 

We consider a finite difference scheme for the nonlinear parabolic differential 
equation 



^-V-(K(a:,p)Vp) = /(7,x) 


in (0,T]xH, 




p(0, x) = p^(x) 


in H, 


(1.1) 


-(K(p)Vp) - v = g 


on (0, T] X r, 





where O is a rectangular domain in {d = 1,2 or 3) with boundary E, v 
is the outward unit normal vector on F, and AT : O x 72. — >■ is a sym- 

metric, positive definite second-order diagonal tensor, i.e., K = diag{ku),l = 

l,...,d. 

To avoid time-step constrains, it is often preferable to solve (1.1) implicitly 
in time. However, for fine meshes, the resulting large systems of nonlinear 
equations can be expensive to solve. To decrease the amount of work ne- 
cessary to solve (1.1), we consider a two-level method where the nonlinear 

Z. Chen, R.E. Ewing, and Z.-C. Shi (Eds.): Proceedings 1999, LNP 552, pp. 345-^^^ 2000. 
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problem is solved only on a coarse grid of diameter H and a linear problem 
is solved on a fine grid of diameter h H. On the fine grid, we approximate 
K(p) by a first-order Taylor expansion about the solution from the coarse 
grid. Thus, instead of solving a large nonlinear problem on the fine grid, 
we solve a small nonlinear problem on the coarse grid and a large linearized 
problem on the fine grid. 



2 A Coarse Grid Nonlinear Finite Difference 



Let for a positive integer s, be the Sobolev space bb|(n). Denote the 

inner product in Hg by 



{f,g)s 




D^f ■ D°^gdn, 



where f,g€ Hs{Ll). We denote by the space of functions whose pth 

spatial derivative is Lipschitz continuous. 

Let V = div) = {u G : V- v G L^(n)} and W = We 

denote the subspaces of V containing functions with normal traces weakly 
equal to 0 and g” as and V”, respectively. 

We consider two quasi-uniform triangulations of a coarse triangulation 
with mesh size Lf denoted by Tjf and a refinement of this triangulation with 
mesh size h denoted by r^,. We consider the lowest-order RTN space on 
rectangles. Thus, on an element B G t^, k = h or H, we have 

Vk{E) = {(aiXi + Pi, a2X2 + /? 2 ; 0:3X3 + P3Y '■ Oi, Pi G TZ}, 

Wk{E) = {a : a G 7 ^}. 



Define 1'° = H and Vp = V”' f]Vk- We use the standard nodal basis, 
where for Vk the nodes are at the midpoints of edges or faces of the elements, 
and for Wk the nodes are the centers of the elements. We define discrete 
products corresponding to the application of the midpoint (M), trapezoidal 
(T), and midpoint by trapezoidal (TM) quadrature rules, and denote the 
associated norms by || • ||r and the error in approximating an integral by 
the given rule by ET{q,r) = (q,r) — (q,r)T- The error in approximating an 
integral by either the trapezoidal or the trapezoidal by midpoint rule is 

\EQ{q,v)\ < C H 

Eerh |a|=2 
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For any </> G L'^{Ti), let 4>k denote the projection of (j) onto Wk] i.e., 

{4>, w) = ($k, w) \/w G Wk- 

This projection operator has the approximation property for (j) G 

U - (l)\\ < C\\(j)\\jy , 0<j<l, k=horH. 

Associated with the RTN mixed finite element spaces is the projection 
operator II : {H^{D)Y ^4, defined by 

(V • (Ilg), w) = (V • g, w) Vw G IFfe, 

with the approximation properties 

||g - ng|| < C\\qhk, ||V • (g - ng)|| < C||V • g||ifc. 

Furthermore, by the definition of Ilg and the midpoint rule of integration, 
we have that the error in the first component of the projection evaluated at 
the center of a grid block side is given by 

Using this estimate, we can bound the L°° norm of the projection by 

l|ng - g||i=o(n) < Ch\\q\\w^(Q.)- 

In the expanded mixed formulation of (1-1), we define the variable u = 
— Vp and u = K{p)u. The analysis uses the estimate 

IIUm" - + ||nu" - iff\\TM < Ck^hh- 

For the lowest-order RTN spaces on rectangles, for any g = {q^,q^) G 
and E G Tk, 

ll|(n,nitf(^, < \\%hHm. Il|;(n5)''lk.,« < ll^hnEy 

We uses the auxiliary variables, iff and w", defined by 
w" = -Vp”, m” = 7f(p”){t”. 

The problem is to find (p”, {!",«") G (IF x V x V) satisfying 

(p”, w) + (V • u”, w) = (/”, w) Vw G IF, 

(m”,u) = (p”,V-u) VvGF^, 

{u^ ,v) = {K{p^)vff ,v) Vu G F. 



( 2 . 1 ) 
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We choose cell-centered finite difference approximations G Wh, G Vh, 
and G Vf} to the functions p{t^, ■), u{t^, ■), and u{f^, ■), respectively, for 
each n= satisfying 

m.v)TM = {P^.y -v) VveV^, ( 2 . 2 ) 

(C/S, v)tm = (K(Ph(P}^))U^, v)t Vv G Vh. 

and we take P^ = pH{t^,-)- We define Vh{p) from the values of pij for 
i = 1, . . . , Nx and j = 1, . . . , Ny by the bilinear interpolation operator. If p 
is twice differentiable in space, for Vh{p) we have 

\\Vh{p)-p\\o.<chV 



Theorem 2.1 Assume /" G LF'{LI) for each n and K is continuously dif- 
ferentiable in its arguments. Then, for At sufficiently small, there exists a 
unique solution to problem ( 2 . 2 ). 

We make the smoothness assumptions 
(HI) /G Wi(0,T;L^(fI)); 

(H2) Kii{x,p) G C'i(H X 7^) n x U), I = I, . . . ,d, and Ku and 

are uniformly Lipschitz continuous in p; 

(H3) There exist positive constants if* and K* such that, for z G 'Rd^, 

if*||z|P < z*K{x,p)z < if*||z|p, X € it, p € TZ; 

(H4) pewl{0,T;C^’Hn)y, 

(H5) u,uG Wl{0,T;C\n)Yr\Wl{0,T;Wl{n)Y. 

Theorem 2.2 For each n = 1 , . . . , N , let {p^tU^hiIPh) G {Wh x V// x Vf}) 
satisfy 

(V-C/S,rc) = (&",w) VwGWh, 

{U,v)tm = {PI,V-v) VuGH°, 

(C/S, v)tm = (if (Pff (p"))C/ff , v)t Vu G Vh, 

with &” = f'^ — p'f and P% = P^. Then, under assumptions (H1)-(H5), 

IICCS - w"I|tm + \\ul - u"||tm + \\Ph-p"\\m < CH\ 
WdtPl-dtp-^WMACiH^ + Af). 
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We have the theorem about the convergence of the above difference scheme 

Theorem 2.3 Let P^, IJ^, and U^,n = be defined by equation 

(2.1) with the initial value = pnit^,-)- Assume that (H1)-(H5) hold. 
Then there exists a positive constant C, independent of H and At, such that 

N 

\\pS - P'^Wm + {AtJ^KM - < C{H^ + At). 

n—1 

3 Fine Grid Linear Scheme 



We now consider a linear cell-centered finite difference scheme on the fine 
grid where we make use of the nonlinear solution on the coarse grid. We 
solve the problem for PJ) G Wh, U]f G Vh, UJ) G n= 1, . . . , 

{dtPf:, w) = -(V • U)i, w) + (/” , w) \/w G Wh, 

{Uf^,v)TM = {Pj:,V-v), 

{U'f,v)TM = {K{VH{Pfj))Ufi,v)T, 

+ {K^{PH{Pfj))QHm){PH{Pf:)-PH{Pfr)),v)T Vn G Vh. 

We define Qh{u) as a vector quantity with entries Q^{u^) and 
The entry Qfj{iP) is defined from the values of ^ for i = 0, . . . , and 

j = as follows. For points {x,y) such that Xi_xji A x < Xj_|_i/ 2 , 

t G {1, . . . , iVa;} and Vj < y < yj+i, j G {1, . . . , Ny}, we take Q|f (u"') to be 
the bilinear interpolate of ^^+ 1/2 j> ^^- 1/2 ^i+i /2 j+i- This 

leaves a strip half a cell in height along the top and bottom of the domain. 
We consider the bottom strip. For i = 0, . . . IVa;) set 

QUun(Xi+^/2,yi/2) = W + if|)«?+l/2.1 - ^fK+l/2.2/(^f + Hf). 

For other points, an analogous interpolation definition is defined. We 
have the theorem about the convergence of the above linear finite difference 
scheme. 



Theorem 3.1 Let P]f , Uf^, and U]f , n = 1, . . . ,N , be defined by equation 
(2.2) with the initial value = ph{t^,-). Assume that (H1)-(H5) hold and 
that H and AtH~'^/‘^ are sufficiently small. Then there exists a positive 
constant C, independent of h, H , and At such that 

. 1/2 



\Ph-P^\\m 



' N 

AtJ2K4Uf-ii'^\\T \ < + h^ + At). 

. n=l J 
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4 Conclusions 

We have presented and derived error estimates for a two-level finite difference 
scheme for nonlinear parabolic equations and have shown and optimal-order 
convergence in both and for the coarse and fine grids. We remark that 
we have only considered the case of the Neumann boundary condition and a 
diagonal tensor K. 

The estimates derived in this paper use the inverse estimate to bound the 
L°°-norm in terms of the L^-norm. As a result, our estimates may not be as 
sharp as possible. However, no better L°° estimates exist at this time for the 
expanded mixed finite element method. 

The two-level scheme described above could be extended by adding levels 
and expanding about the next coarse solution in the nonlinear term at each 
new level. This corresponds to adding more Newton-like iterations with each 
iteration taking place on the next finer grid. This possibility is under investi- 
gation. We are currently implementing these two-level methods for equations 
of interest to flow in porous media. 
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A Compact Operator Method 
for the Omega Equation 

Francisco R. Villatoro Jesus Garcia-Lafuente 



Abstract 

The ageostrophic vertical velocity field for the mesoscale dynamics 
of the Alboran Sea is determined by means of synoptic data and nu- 
merical solution of the omega equation. For the stationary, horizontal 
geostrophic velocity field, a fourth-order, compact operator differentia- 
tion is used. The vertical ageostrophic flow is determined by means 
of a numerical solution of the omega equation with a Q vector formu- 
lation, solved by means of a fourth-order accurate, compact operator 
method. The numerical results confirm that on a macroscale, upward 
motion occurs upstream of the anticyclonic rotation while downward 
motion mesoscale takes place downstream. 



KEYWORDS: oceanography, ageostrophic flow, omega equation, compact 
operator method 

1 Introduction 

The geographical location of the Alboran Sea between the South of Spain and 
the North of Africa, receiving water from the Atlantic Ocean (AO) through 
the Strait of Gibraltar, makes its study of fundamental importance for the 
research of the superficial circulation on the Western basin of the Mediterra- 
nean Sea. An expedition made in July 1993 shows the picture: the Atlantic 
jet from the AO proceeds as a wave to the East generating two anticiclonic 
gyres where the water from the AO accumulates near the African coast, the 
so called West (WE) and East (EE) gyres, which are separated by Cape Tres 
Focas and the Alboran Island 0. The WE, which has also been found by 
other expeditions, is known to be permanent but there is some controversy 
related to the EE, which may be seasonal. 

The data for the temperature and salinity distribution found in the Al- 
boran Sea are synoptic, showing the general situation, but incapable to track 
the temporal evolution, and staggered, so these data have been Altered and 

Z. Chen, R.E. Ewing, and Z.-C. Shi (Eds.): Proceedings 1999, LNP 552, pp. 351-^^^ 2000. 

(c) Springer-Verlag Berlin Heidelberg 2000 
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interpolated on a regular grid as shown in Q , to study the stationary state of 
the fluid. From the experimental data the pressure/density ratio A</) \m?‘ / s^] 
and the Brunt- Vaisala frequency N [s~^] are obtained every 5 meters from 
2 to 202 m, in a regular grid of 10 x 10 km. In this paper these data have 
been used to obtain the stationary, horizontal geostrohpic velocity held in the 
Alboran Sea by means of a compact operator differentiation of fourth-order 
accuracy. 

The estimation of the vertical velocity yields information on the exchanges 
of heat and salt near the surface of the ocean and is responsible for the vertical 
transport and distribution of nutrients and plackton ^ . As direct measures 
of vertical velocities are difficult, the solution of the omega equation can 
be used. In this paper the quasigeostrophic omega equation laoi, valid in 
regions of low dynamics and Rossby numbers, is solved by means of a fourth- 
order accurate, compact operator method j0|. It is standard to close this 
numerical method using homogeneous Dirichlet boundary conditions |i|7|. 

When the horizontal velocities are smaller in one direction than in the 
other, the three-dimensional omega equation can be simplified to a two- 
dimensional omega equation for the stream function. A compact operator 
method has also been developed for this equation. Dirichlet boundary condi- 
tions have been used except for upstream and downstream, for which Roache, 
the null second-order derivative of the stream function, and Neumann, res- 
pectively, boundary conditions are used due to their better accuracy |^. 

In §2 , the equations relevant to the problem studied in this paper are 
presented. §3 is devoted to the presentation of the numerical, fourth-order, 
compact operator methods developed in this paper. §4 shows the main results 
obtained in this paper. Finally, the last section is devoted to the presentation 
of conclusions. 



2 Presentation of the Problem 

When the Coriolis forces approximately balance with pressure gradients, 
the flow field U = {U,V,W) can be separated in a geostrophic and quasi- 
geostrophic flow as 

U=Ug + Ua, V =Vg+Va, W=W = Wa, 
where |VF| < |C/|, \V\, |ua| < \Ug\, and |ua| < \Vg\. 



A Compact Operator Method 



353 



The geostrophic field can be calculated from the dynamic height using 
the expressions Pj: 



rr 1/ - 

® f dy ’ ^ f dx 



( 2 . 1 ) 



where f = 20. sin ■!/; is the local frequency of rotation of the Earth at latitude 
ip ~ 36° at the center of Alboran Sea and / = 8.55 x 10“® [s“^]. 

The steady-state omega equations for the quasi-geostrophic fiow are as 
follows 



d{N^ w) ^2 dUa d{N^ w) dVa dua , dva , dw ^ 

— ^ ^ ~ ^ ^ ^ ~ 

ax az ay oz ox ay oz 

( 2 . 2 ) 

where the potential vorticity Q is obtained from the geostrophic fiow as 



dVo dUa . dVa dVa 



dx dz dy dz 



= 2 f I » g _i_ » y riv — _o f / y y _i_ y y 



dUn dUa dUn dVo 



dx dz dy dz 



A simple omega equation for the vertical velocity fiow can be easily obtained 
using the continuity equation in (2.2), yielding 



■2^ 

dz"^ 



w 



/ d'^ d^ \ 

dy'^ ) 



{N^ w) 



dQ^ dQy_ 

dx dy 



(2.3) 



The boundary conditions for this equation are difficult to determine since 
the synoptic experimental data have been interpolated in a regular grid. 
The bottom and top of the grid are fictitious boundaries at 202 and 2 m., 
respectively, of depth. The south and north boundaries correspond to water 
and coast points. The west and east boundaries correspond to water incoming 
from the Atlantic ocean and water going out from the Alboran Sea. Since the 
detailed boundary conditions are very difficult to obtain, Dirichlet boundary 
conditions have been used for all the boundaries of the computational grid 
since Viudez et al. have shown they are enough accurate for this problem. 

Eq. (2.3) can be further simplified if the variation of the horizontal velocity 
in the a:-direction is smaller than that in the y-direction: 



dug 

dx 



< 



dVg 

dy 



By introducing the stream function 

dtp 

dz 



w = 



dtp 
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we have the simplified omega equation 



d 



dtp 



Similarly, when 



dy V dy 
dv. 



d^tp 






dz^ 



dy 



< 



dUa 



dx 



another stream function can be introduced 

dtp’ 

Ua = W 

dz 

yielding the simplified omega equation 



dtp' 
dy ’ 



d 



dx 



dtp' 






dx 



d'^tp' 



= Q". 



(2.4) 



(2.5) 



For the boundary conditions for the stream function equation, homogeneous 
Dirichlet ones have been used for the north, south, bottom, and top bound- 
aries, Neumann ones for the east, as considered as outflow, and Roache ones 
for the west inflow. When solving the stream function equation, Roache bo- 
undary conditions, i.e., the null second-order derivative, are more accurate 
than Neumann ones |^. 



3 Fourth-Order Compact Operator Method 



The calculation of the geostrophic field using eq. (2.1) requires the use of a 
differentiation formula. In this paper a compact operator method of fourth- 
order accuracy is used. This implicit method uses the difference expression 



lJ^ = 



dUj 

dx 



1 V + A 

2Axl-f (52/6 






z = 0, 1,...,A, 



(3.1) 



where the finite difference operators are defined as 



EU, = Ui+i, \7 = E-1, A = l-E~^, S^ = \/-A. 



Eq. (3.1) yields the linear, tridiagonal system of equations 



Ui-i + 4:Ui + Ui+i — {Ui+i — Ui-i), 



whose truncation error expansion is 

• _ dU^ Aa;4 dPU^ 

* dx 180 dx^ 



0{Ax^) 



(3.2) 
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showing its fourth-order accuracy. For the boundary conditions, asymmetri- 
cal formulas are used 

5t/o -f 16t/i - 3JJ2 = (MUo - 26 Ui - 2W2 + IOC /3 - C/ 4 ) , 

3Ax 

Wn + IWn-1 - 3Un-2 = ^ (44C/^ - 26Un-i - 27Un-2 

+10Un-3 — Un-a) , 

for the left and right, respectively, boundaries. These expressions have been 
selected since they yield exactly the same truncation error terms, up to the 
fifth-order included, as the inner formula, i.e., eq. (3.2). The solution of the 
corresponding tridiagonal systems is obtained by using the Thomas method. 

The second-order derivatives appearing in the elliptic equations (2.3)- 
(2.5) are also approximated using a compact operator method, e.g.. 



U,= 



d^Ui 



1 



<52 



c/^, 



dx'^ Ax“^ 1 -I- 52/12 
which yields the tridiagonal system of linear equations 

C/,_1 + 10 C/, + C/i+1 = ^ (C/,+1 - 2 C/, + C/,_i) , 



(3.4) 



whose truncation error term is 



Ui = 



d'^U, Ax^ dPU, 



0 (Ax®). 



(3.5) 



dx'^ 240 dx® 

When the compact operator approximation (3.4) is used for the inner points 
in the grid, different boundary conditions can be used. In this paper Dirich- 
let homogeneous boundary conditions are used for eq. (2.3) and Dirichlet, 
Neumann, and Roache homogeneous boundary conditions for both eqs. (2.4) 
and (2.5). For the Dirichlet boundary conditions, Uq = Un = 0, for the 
Neumann ones, eqs. (3.3) are used and for the Roache boundary conditions, 
the stencil is used at the left boundary: 



I6C/0 -f I5IC/1 - 
1 /771 

Ax2 4 



83C/2 




C/i + 




285 



U3 + 





and the same expression with Un-i replacing Ui for the right one. This 
expression has truncation error terms equal to eq. (3.5) up to the fifth-order 
(included). The substitution of eq. (3.4) into eq. (2.3) yields 

/2(12 -I- Sl){12 + + (12 -I- 52)(12 -|- 

+(12 + 52)(12 + 52)^ ( N ^^ kmjk ) = (12 + 52)(12 + J2)(12 + 52)+„+, 




356 



Villatoro and Garda-Lafuente 



where Wijk ~ w(iAx,j Ay, k Az) and homogeneous Dirichlet boundary con- 
ditions are used. For the calculation of the horizontal ageostrophic flow, 
eqs. (2.2), an explicit, fourth-order, Runge-Kutta method is used with homo- 
geneous Dirichlet boundary conditions, e.g., Uo(0) = 0 and Ua(0) = 0. 

Eq. (3.4) can be easily substituted into eq. (2.5) or into eq. (2.4), separa- 
ting this equation as 



dtp 

dy 






and yielding 

= + 

(12 + 61} ^‘^11 + /^(6 + = (12 + 



where <j)jk ~ 4>{j Ay,kAz), a Neumann boundary condition is used in the 
left (inflow) boundary, a Roache one in the right (outflow), and Dirichlet 
ones in the other four ones. The Va and w components of the ageostrophic 
velocity are calculated by differentiation of the stream function ip by means 
of the compact operator eq. (3.1); the remaining component Ua is obtained 
solving the first equation of (2.2), with Ua{0) = 0, by means of an explicit, 
fourth-order, Runge-Kutta method. 

The algebraic equations to be solved when the compact operator method 
is applied to eq. (2.3) have 15 non-null diagonals and 9 non-null ones for 
eqs. (2.4) or (2.5) and are well-conditioned. These linear systems have been 
solved by means of the stabilized bi-conjugate gradient method using incom- 
plete LU factorization as a preconditioner. The mean of required iterations 
is approximately 10 for a residual error treshold of 10“^°. 



4 Presentation of Results 

Fig. 1 shows the geostrophic flow field resulting from the experimental data 
available. These figures show the west gyre, which continue being visible for 
all depths, and the smaller east gyre, which apparently disappear as depth is 
increased. A zoom of the bottom right plot shows that the east gyre remains 
despite with very small velocities. The code has been verified checking that 
the conservation law for the momentum approximately conserves. 

Fig. 2 shows the ageostrophic flow field calculated using the simplified 
omega equation (2.5), based on the stream function formulation. Similar re- 
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suits have been obtained for the omega equation (2.4). Although the conser- 
vation of momentum approximately also holds for the ageostrophic velocity 
field (ua,Va,w), the results are not expected by physical principles applied 
to this problem; i.e., upward motion must occur upstream of the anticyclonic 
gyres while downward motion must take place downstream. The reason for 
this behaviour is that the hypothesis required to derive eq. (2.5) or eq (2.4) 
does not hold since the numerical results yield 



dUa 




dVa 


dx 




dy 



However, the results are encouraging since the velocities we have obtained 
have the correct orders of magnitude 

k| < |Va| « \Ua\ < \Ug\ « \Vg\, 

required for the validity of the complete omega equation (2.3). 

Fig. 3 shows the vertical velocity field for the solution of the complete 
omega equation (2.3). The conservation law also approximately holds for 
this numerical method assuring the improved accuracy of the fourth-order, 
compact operator method over a standard second-order one. The resulting 
velocity field has similar orders of magnitude than the one obtained with the 
simplified omega equation, e.g., 

|w| « 10“’^ m/s, |Ua| « |Ma| « 10“^ m/s, \Ug\Ki\Vg\Kilm/s. 

Fig. 3 shows the vertical velocity contours for a j/z-plane at x-positions 
corresponding to the two sides of both the West (top plots) and East (bottom 
plots) gyres. This figure and other figures not shown here show that upward 
and downward motion occur at the south and north directions, respectively, 
near the west side of the West gyre and the reverse, i.e., downward and 
upward motion at the south and north directions, in the east side of this gyre. 
Similar behaviour, but less remarked, is found in the East gyre. These results 
are not sufficiently confident so as to confirm the East gyre as a stationary 
phenomena but confirm that the numerical method yield physically realistic 
results. 

5 Conclusions 

A fourth-order, compact operator method has been developed for the calcu- 
lation of the geostrophic velocity field and the solution of both the simplified 
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Figure 1: Geostrophic flow field for depths 2, 62, 122, and 182 meters from top 
left to bottom right. 



(va, w) at center WG stream function 




y : South-North y : South-North 



Figure 2: Vertical velocity flow both (va,w) (left) and stream function isocontours 
(bottom)) for a j/-plane (South-North direction) trough the center of the West gyre 
in the Alboran Sea. 
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w (y=120 km.) w (y=200 km.) 




w (y=300 km.) w (y=320 km.) 




Figure 3: Isocontours of the vertical velocity flow at j/ 2 -planes at the west (left 
plots) and east (right plots) of both the West (top plots) and East (bottom plots) 
gyres found in Fig. 1. 

omega equation for the stream function and the three-dimensional complete 
omega equation. The resulting methods are highly accurate and allow an easy 
treatment of Dirichlet, Neumann, and Roache boundary conditions. The sy- 
stem of linear algebraic equations to be solved is well-conditioned and has 
been solved by means of the stabilized bi-conjugate gradient method using 
the incomplete LU factorization as a preconditioner. 

The numerical method developed in this paper has been applied to a set 
of experimental data which have been previously interpolated and smoothed 
in a regular grid. The geostrophic velocity field presents clearly both the 
West and East gyres as expected from previous studies. The ageostrophic 
vertical velocity field does not satisfy the hypothesis required for the use of 
the simplified omega equation based on the stream function. However, the 
results obtained in that case have the correct order of magnitude for the 
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ageostrophic velocities. 

The solution of the three-dimensional omega equation for the calcula- 
tion of the vertical ageostrophic flow field confirms the presence of the West 
gyre, since upward motion occur upstream of this anticyclonic gyre while 
downward motion takes place downstream, but is not confident enough for 
the East one. The only reason we can point out is that the interpolation 
and smoothing procedures used to obtain a regular grid from the staggered 
original experimental data are not correct when the velocities are small. 

Our results indicate that further experimental data are required to model 
accurately the vertical flow for the East gyre appearing in the geostrophic 
flow field. Also new techniques for the interpolation and smoothing of the 
experimental data must be developed. Finally, new numerical methods based 
on compact operator methods using staggered grids instead of regular ones 
must also be developed. 
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for Compressible Miscible Displacement 
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Abstract 

A Schwarz type domain decomposition algorithm is formulated to 
solve an approximation for miscible displacement of compressible fluids 
in porous media and convergence rate of the algorithm is analyzed. 
First, a splitting positive definite mixed element procedure is used to 
treat the pressure equation of parabolic type. The coefficient matrix of 
the mixed element system is symmetric positive definite and the flux 
equation is separated from the pressure equation so that the approxi- 
mate solution of the flux function can be independently obtained, and 
a characteristic finite element method is used to treat the convection- 
diffusion equations of the concentrations. Then a Schwarz type domain 
decomposition algorithm is introduced to solve the symmetric positive 
definite systems step by step. How many iterative cycles are needed at 
each time level? A convergence analysis is given. 



KEYWORDS: compressible flow, miscible displacement, domain decomposi- 
tion, characteristics, mixed element, convergence analysis 

1 Introduction 

The displacement of multi-phase multi-component compressible flow in porous 
media is governed by a nonlinear system consisting of a parabolic equation of 
pressure of the mixture and convection-diffusion equations of concentrations 
of fluids. To illustrate the method, we consider as our model the single phase 
and two component displacement of one compressible fluid by another in a 
porous medium. Let be a convex bounded domain in with a boundary 
r. Under the assumptions that no volume change results from the mixing of 
the components and that a pressure-density relation exists for each compo- 
nent in a form that is independent of the mixing and that the components 
are of slight compressibility, Douglas and Roberts proposed a mathematical 

Z. Chen, R.E. Ewing, and Z.-C. Shi (Eds.): Proceedings 1999, LNP 552, pp. 362-^^^ 2000. 
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model POl (with x € fl, 0 < t < T) 

(a) d{c)pt + y -u = q, u = -a{c){Vp + 

(b) 4>ct + u • Vc — V • (P(u)Vc) + h{c)pt = (c — c)q, 



where c = ci = 1 — C2 denotes the (volumetric) concentration of the first 
component of the fluid mixture, q the volumetric rate of external flow, c 
the concentration of the first component in the external flow, which must be 
specified at injection points {q > 0) and be assumed to equal c at production 
points {q < 0), and zi and Z 2 the constant compressibility factors for the 
j-th component, i = 1,2. k = k(x) and (f> = (j){x) are the permeability 
and the porosity of the rock, respectively, p, = p{c) is the viscosity of the 
fluid, 7 = 7(c) is the gravitational coefficient and H is the depth function 
of oil reservoir, p is the pressure of fluid and u is Darcy’s velocity. D{u) = 
4i[dml + |u|(d;ifl(u) + dtf-'-(u))] denotes the diffusion matrix describing the 
effects of molecular diffusion and dispersion where f(u) = {uiUj /\u\^) 2 x 2 is 
the 2x2 matrix representing orthogonal projection along the velocity vector, 
£’'^(u) = X — £’(u) is its orthogonal complement, and dm, di, and dt are the 
molecular diffusion and dispersion coefficients, respectively. 



x(c) = -^, d{c) = 4>y^ ZiCi, b{c) = 4>{zi- Z 2 )c{l- c). 

Kc) ^ 



We also assume that no flow occurs across the boundary: 



u • = 0, {T>{u)\7c) ■ n = 0 on F, 0<t<T, (1.2) 



where v is the unit vector outer normal to F. In addition, the initial conditions 



p = Po, c=cq in n, t = 0, 



(1.3) 



must be given. 

Many works on approximations for the displacement of multiphase multi- 
component flows have been done; for instance, see Douglas PEEin], Douglas, 
Ewing and Wheeler dlS], Duran m, Yuan m, and Yang m for time- 
discretization procedures of miscible displacement of incompressible fluids 
in porous media by a modified method of characteristics combining with the 
classical mixed finite element methods, and see Douglas and Roberts uni and 
Chou and Li jOj for numerical methods for a model for compressible miscible 
displacement in porous media. 
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In |21 j . we formulated a new approximation based on a modified method 
of characteristics combining with a splitting positive definite mixed element 
procedure. We use a splitting positive definite mixed element to treat the 
pressure equation of parabolic type. The coefficient matrix of the mixed 
element system is symmetric and positive definite and the flux equation is 
separated from the pressure equation so that we can independently obtain 
the approximate solution of the flux function and then, if required, obtain 
the approximation of the pressure almost explicitly. We use the characteri- 
stic finite element method to treat the convection-diffusion equations of the 
concentrations. 

The purpose of this article is to formulate a Schwarz type domain de- 
composition algorithm to solve the approximation described in m and to 
analyze the convergent rate of the algorithm. It is clear that if iterative cycles 
are indefinitely implemented at each time level, we can obtain the solution of 
the fully discrete systems. But indefinite cycles are impossible and unneces- 
sary. How many cycles are needed at each time level? We give a convergence 
analysis. 

The article is organized as follows. In §2, we summarize the numerical 
method for the compressible displacement problem (1.1) based on a modified 
method of characteristics combining with a splitting positive definite mixed 
element procedure. In §3, we formulated a Schwarz type domain decomposi- 
tion algorithm to solve the fully discrete scheme defined in §2. In §4 we give 
the error estimate of the approximate solution. 



2 A Characteristic Mixed Element Procedure 



It is easily seen that the equation (1.1(b)) is convection-dominated and direc- 
tly depends on Darcy’s velocity u. A better numerical result can be obtained 
by using mixed element methods to approximate directly u and characteri- 
stic methods to treat the convection term. We assume that the coefficients 
a(c) and d{c) in system (1.1) are below bounded positively as in ^Dj. In- 
troduce the space H — {w S (L^(H))^; V • w G L^(H), w • z/ = 0 on T}, 
S = and A4 = Let a(c) = l/a(c) and /3(c) = l/d(c). The 

system (1.1) is equivalent to the following characteristic mixed weak form 
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that seeks (c,u,p) G S x H x M such that 



(a) (V'c^, z) + (2?(u)Vc, Vz) + {b{c)f3{c){q - V • u), z) 



= {{c-c)q,z), VzG5, 

(6) ((a(c)u)t,w) + (/3(c)V • u, V • w) 



( 2 . 1 ) 



= (/3(c)g, V • w) - (7(c)tV77,w), V w G "H, 
(c) (pt,v) = {f3{c){q-V -u),v), yvGM, 



where ip = \/ (jA + |up and 




( 2 . 2 ) 



denotes the direction derivative along the characteristic line of the convection 
term (pct + u • Vc. 

Make a time partition 0 = tg < ... < ... < t^ = T. Set r„ = 

tn — tn-i- Take r = maxi<„<7v as a time step size. Let Th^, Thu’ 
be three families of quasi-regular finite element partitions of the domain fl, 
and he, and hp be the mesh parameters, which generally denote the largest 
of diameters of elements in the partitions, respectively. Let TLhu CTLhe each 
of the classical mixed elements defined on the partition 7)i„, such as BDDM 
0, BDFM 0, BDM 0, CD 0, and RT elements in PS|. Let Sfiu C S and 
Mhp C Ai he usual finite element spaces defined on the partitions Th^ and 
Thp- Based on (2.1), we define a characteristic mixed element scheme. 

Characteristic mixed element scheme. Given an initial approxima- 
tion (c^,u)(,p51) G Sh^ X T-Lhu X Mhp such that 



(а) {clZ) = {co,Z) VZGShu, 

(б) (a(co)u°,W) = -(Vpo+7(co)Vif,W) V W G (2.3) 

(c) {plV) = {po,V) yVGMhp, 



seek c'fi G Sh^ such that 




where g" = max(0, g") and c\ 



-n-l 



is given by 




X = X - 



,n— 1 



{x)/4>{x) V a; G D; 



(2.5) 



366 



Yang 



seek uJJ S 'Hh^ such that 

(a(cJJK, W) + r„(/3(cJJ)V • <, V • W) 

= (a(cri)ur\ W) + r„(/3(c^)g-, V • W) (2.6) 

-((7(c)()-7(cr'))Vi?,W) 

seefc € J^hj, such that 

{pl,V) = {pl-\V)+T^{P{cl){q'^-V-vLl),V) yVGMh^, (2.7) 



for n = 1, 2, . . . , N. 

Because point x may lie outside domain f2 , we must continuously ex- 
tend the value of function to the exterior of fl. Define the operator Q as 
a projection from the exterior domain = R^\fl to boundary T and the 
point tF(x) as the symmetric point of point x S corresponding to the 
projection point Q{x). We say a domain Q to satisfy the property A, if 
there exists a constant 5 > 0 such that for each point x G where 

fls = {x\ inf j,go \x—y\ < 5}, the projection point Q{x) is uniquely determined 
and tF(x) G In this case, we can define an extension by 



ip{x) 



p{x), X G 

(p{tF{x)), X G 



( 2 . 8 ) 



(2.8) defines the linear bounded extending operator from iJ^(n) to 
A domain satisfies the property A, if its boundary F is smooth. In the 
case of convex polygon, the domain fl satisfies also the property A. After 
the function is extended through use of above-mentioned method (2.8), 
is defined for sufficiently small r„. 

It is easily seen that the coefficient matrix of the mixed element system in 
schemes (2.6) and (2.7) are symmetric positive definite and the flux equations 
(2.6) on the flirx variable is separated from the pressure equation (2.7) 
on the pressure function The matching relation (i.e., the BBL-condition) 
between the mixed element space Mhp and required by the classical 
mixed element spaces in 1313111 El ESI now is not necessary. The flux function 
\Yf can be solved independently and then, if required, the pressure function 
pf^ can be obtained almost explicitly. 

From the viewpoint of calculation, one can choose usual continuous finite 
element spaces as T-Lh^ ■ But usual continuous finite element spaces, in general, 
cannot yield an approximate solution with optimal accuracy in L^-norm. To 
get the optimal approximation, we still choose the classical mixed elements as 
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TLh^ ■ The usual continuous finite element spaces may be used as the pressure 
function finite element spaces Aihp- It is clear that the number of unknowns 
in systems (2.6) and (2.7) is less than the number of the unknowns in the 
classical mixed element systems. Since systems (2.6) and (2.7) are symmetric 
positive definite systems, many fast effective algorithms can be used. 

We assume that the finite element spaces 'Hhp and Mhp have appro- 
ximate properties that there exist integers fci > fc > 0, r > 0 and s > 1 such 
that, for any 1 < q < oo 

l|w - W||(i,(n))2 < 7s:/i^+i||w||(^yfe+i,,(n))2 

W E 

irf ||V-(w-W)||i,(n) < 7s:/rJ)H|w||(w'=i+i.<!(n))2 

W E ^hu 

(b) inf \\z- Z\\L<i(n) < Khl+^\\z\\w‘+c,,^n)y z G SC\W^+'^’‘i{n), 

(c) \\v-V\\L<i(^a)<Kh^p+^\\v\\w-+c<i(n)'i vGMC\W^+'^’‘i{n), 

(2.9) 

where fci = fc -|- 1 in the cases that TLhp is each of BDFM and RT mixed 
elements and A:i = fc in the cases that TCh^ is each of BDDM and BDM 
mixed elements. Both cases are included in the CD mixed elements. In EH, 
we proved the convergence result. 

Theorem 2.1 Suppose that the finite element space Hh^ is each of the clas- 
sical mixed elements described in HElllEllIBI and that the coefficients in 
system ( 1 . 1 ) have the first and second order continuous derivatives and the 
solution of system (1.1) is smooth. If the mesh parameters he, and r 
satisfy the relations 

(a) hu = o{hc) {k = 0), hu = 0{hc) {k > 1), 

(b) T = o(he), T = o{hu), 
then the a priori error estimates 



( 2 . 10 ) 



(a) 



(b) 



max lie” — c7|| r 2 |'Q'i -I- max | 
< K{hl+^ + ht^ + r}. 



i/tll(L=(n))2 



( 2 . 11 ) 



max Up" — 

Kn<N 



L2(f2) 



< K{h;+^ + hi+^ + h>f^ + t} 



hold, where the constant K only depends upon some norms of the solution of 
system (1.1) and time T. 

In the next section, we formulate a Schwarz alternating domain decom- 
position algorithm to solve the systems (2.4) and (2.6). 
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3 Domain Decomposition Algorithm 

In this section, we focus on the study of the multiplicative Schwarz iterative 
algorithm for solving the systems (2.4) and (2.6). For sake of convenience, 
we assume that Th^ = Th^ = Th and k = s. We assume that a set of 
overlapping sub-domains are given, whose boundaries are aligned 

with the mesh of partitions Th, ^ = Uti Let Th,i be the restriction of 
Th in ^i, and Hhi^i) and Sh{^i) defined on Th,i be the restriction of Hh and 
Sh in fli, respectively. Define 'Hhfii^i) = {W G "H/j(n);W = 0 in 
and Sh,o{^i) = G Sh', Z = 0 in We assume that the domain 

decomposition satisfies the basic condition 

Condition A. For each x € Cl, there exists an open domain and 
j G {1,2,..., Q} such that x G and p| il C ilj. 

We can define a Schwarz alternating iterative scheme to solve the system 
(2.4) and (2.6) step by step. 

Alternating Iterative Scheme. Given an initial approximation (C°, U°) 
G Sh„ X such that 

(а) {C^,Z) = {co,Z) yZGSh^, 

(б) (a(co)U°,W) = -(Vpo + 7 (co)VII,W) VWG?t„„, 

for n = 1,2, . . . , N , seek (C”,!!") G Sh^ x by the following iterative 
method. Seek C” G Sh„ in the three steps: (1) Let Cq = C"“^; (2) Find 
CC^Q+i ^ for i = 1, 2, ... Q, such that 

z) + t„(P(U"-1)VC;q+„ VZ) F Z) 

= Z) + r„(6(C”-i)/3(C”-i)(V • U”-i - g”), Z) (3.2) 

+T„{qiir,z) yzGShpm, 

ch+^ = ch+^-l (3-3) 

where (7”“^ is given by 

= C'"“^(x), X = X — TnlJ^~^{x)/4i{x), y X G Lt, (3.4) 

/orj=0,l,...,m-l; (3) Let = 

Seek U” G Th,, in the three steps: (1) Let Uq = U"“^; (2) Find U”q_|_^ G 
Th, for i = 1,2, . . .Q, such that 

(a(C")UjQ+„ W) + r„(/3(C")V • V • W) 

= (a(C'”-i)U”-i, W) -h r„(/3(C”)V, V • W) 

-((7(C") - 7(C"-1)VH, W), V W g Th„,om, 



(3.5) 
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= u^Q +,-1 (3.6) 

for j = 0, 1, ... ,m — 1; (3) Let U" = where m denotes the iterative 

times at every time step. 

According to the general theory of the Schwarz alternating algorithm 
(see ^lElEI)) the iterative sequence {(C^Qj converges to the 
solution (c)(,u)() at each time step as iterative times m tends to infinity and 
under the condition A. But an infinite iteration is impossible. We want to 
know the convergent rate of the iterative sequence {(C^g, U” so that 

we can stop the iteration at the certain accuracy. On the other hand, we hope 
that the iterative sequence {(O^g, converges as fast as possible 
so that we can implement very small iterative cycles to obtain very good 
approximate solutions. 

In next section, we state the convergence theorem. 



4 Convergence Analysis 

In this section, we analyze how the convergent rate of the iterative sequence 
depends upon the mesh parameters h and r under some 
stronger conditions than the condition A, which are practical and reasonable. 
Condition B. The sub-regions can be divided into four parts: 

D,= Y. J = l,2,3,4, ro = 0, r^ = Q, (4.1) 

where subdomains in Dj are disjoint, {Di,D 2 }, {D 3 ,Z? 4 } and {Z)iljZ? 2 , 
DsijDi} O'T'e the domain decomposition of Di\J D 2 , D 3 (J Z ?4 and LI, res- 
pectively, which satisfy condition A. 

Condition C. The sub-regions can be divided into k parts: 

= X! J = ?'o = 0, r,^ = Q, (4.2) 

rj_i + l<2<rj 

such that 

(1) is a domain decomposition of LI satisfying the condition A 
and Dj p| = 0 for I ^ j — l,j -\- 1. 

(2) For each 1 < j < k, {^iYiLr^ 1+1 ® domain decomposition of Dj 

satisfying the condition A and p| = 0 for I ^ i — l,i -\- 1. 

Conditions B and C can be satisfied easily. 
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Theorem 4.1 Assume that the domain decomposition satisfies the condition 
B or condition C. Let (c”, u”) he the solution of the system (1.1), (C”, U”) be 
the solution of the alternating iterative scheme, and m be the iterative times 
at each time step. Then there exists the constant K, which is independent of 
h, T and m, such that 

3 ) 

< K{h^+^ + T + T~i{T + h)T}. 

5 Conclusions 

From the a priori error estimate (4.3), we see that the error results from 
two parts, one from the approximation error of the finite element function 
spaces and time discretization, which is bounded by + r), and other 

from alternating iteration, which is bounded by 0 (t“^/^(t + ft,)™/"*). To get 
an approximate solution with an optimal global accuracy, infinitely iterative 
cycles are not needed. Generally, if r = 0{h^~^^) and m > 6(fc+ 1), then the 
global error is bounded by + r). This means that only 6(k+l) cycles 

are required. 
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A Boundary Element Method for Viscous 
Flow on Multi-connected Domains 

Dequan Yang Tigui Fan Xinyu Yang 



Abstract 

A viscous flow problem in multi-connected domains is studied using 
a boundary element method. The results obtained are compared with 
those in [3] using a finite element method. These results show that 
the method used can treat the viscous flow with many different kinds 
of boundary conditions. This method is convenient and faster than 
others. 



□ 

KEYWORDS: boundary element method, multi-connected domain, viscous 
flow 



1 Introduction 

In practical viscous flow problems in multi-connected domains such as porous 
springs, the piping and well drilling distribution in oil extraction is very im- 
portant for engineering designs and is a very difflcult problem to solve. Finite 
difference methods face many difficulties because of complicated boundary 
conditions. Finite element methods can treat many different boundary con- 
ditions, but they need lots of input messages, occupy many RAMs, and take 
a lot of time. Due to the nonlinearity of equations in fluid mechanics and 
non-selfadjointness of differential operators, relevant variational functionals 
hardly exist. For this reason, the finite element methods based on the method 
of weighted residuals such as Galerkin’s approach, least squares, and colloca- 
tion methods [2, 1] do not possess any physical meaning. They are nothing 
but a pure optimum scheme for numerical treatment. Using newly developed 
boundary element methods [4, 5], we study the above mentioned problems 
in this paper. change multi-connected domain viscous flows into nonli- 
near boundary mXegration equations of the boundary velocity and boundary 
pressure, then discretize bougl^y integral equations, and Anally change the 
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velocity and pressure differential into the differential of the fundamental solu- 
tion. This treatment that the differential of the unknown variables is changed 
into that of the known variable makes the problem easier to solve. The bo- 
undary variable is determined singly. After determining boundary points, 
any inner points can be calculated. Other methods are difficult to do that. 
In calculating nonlinear problems, creating grids is easier than finite element 
methods. Integral equations are discretized in algebraic equations. Using 
iterative methods, the unknown velocity and pressure are calculated. It is 
important that the velocity and pressure is calculated separately. Comparing 
with [3], this method is more effective. 



2 A Fundamental Equation 

□ 

Let the multi-connected domain be 0 with boundary T = Tti T E 

* j 

1,2, . . . ,t, j = 1, 2, . . . , s) . The known velocity and pressure on boundary 
and Tti are shown in Fig. 1. In U, the dimensionless form of incompressible 
viscous fiuid is 



V •T(U) = V- (UU), V-U = 0 inU, (2.1) 

with the boundary conditions 

T-n\rti = ti, j = l,2,...,t, U|r„,- = Vj, j = l,2,...,s, 

where T(V) is the stress tensor corresponding to V and ft is the outward unit 
normal to F. 

If and are fundamental solutions of Stokes’s equation and Tij{W^) 
is the stress tensor corresponding to lU^, we can obtain the integral equation 

C{X)vk{X)= f^n,(Xo)T,,(W^(X - Xo}ov,(Xo)dTo 

+ f^n,{Xo)v,{Xo)vj{Xo)w^iX-Xo)dTo 
- /p n,{Xom,{V{Xo))ow^{X - Xo)dTo 

+ In '^i(^o)wj^i(X — Xq)vj (Xo)dfio, ^ = Ij 2, 3, 

where 



C(X) = 1 in U, 1/2 on F, and 0 not in fi. 
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Formulas determining pressures are 

P{X)= §^n,{Xo){q^{X - Xo)[vi{Xo)v,{Xo) - T,^(V{Xo))o] 

+ - Xo)v,{Xo)}dro (2.3) 

(2.2) and (2.3) are the fundamental equation treating a multi-connected do- 
main. 




['i^ J . nnijJt! Hiniicctcfl .^tnmain 



rrir,rigk grid 



3 Discretization of Boundary Integrals 

Change F into N elements (for 2D) or N polygonal elements (for 3D) F/j, /? = 
1, 2, . . . , A^. Change the flow region fl into E elements fdj, 7 = 1, 2, . . . , if. 
The velocity and pressure on F^ are replaced by that at the center of the 
figure. The velocity of is also replaced by that at the center of the figure. 
Formulas for discretization are given by 

CaVka= LkijafS'lT'ifSVjff — X)/3j ^kjaf3tj/3 

+ MkjafSnifjVipVjp — NkijajVie^Vje^t 

where a = Fi,F 2 , . . . ,Fat, ei, 62 , . . . , e_e, f3 = 1,2,. ..,N, 7 = 1,2,...,F;, 
i,j,k= 1,2,3, 

Lkijo.f3= f T,,{W\X^-Xo))odTo, 
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^kja.l3 — 


[ w';{x^-Xo)dro, 




Jr0 




[ wY^ix^ - Xo)dn^ 

e-y 



C-y 



When is a small segment element, is a triangle element (shown in 
Fig. 2). Three sides of the triangle are as follows: 

y = yi + ki{x - xi), y = y 2 + k 2 {x - X 2 ), y = ys + ksix - X 3 ). 

The integrals of an arbitrary function f{x,y,Xa,ya) on the triangle region 
are 



f ^ py 

/ f{x,y,Xa,ya)dVt = j dx / 

J 7 -t X I 0 



pyi-ki(x-xi) 

dx / f{x,y,Xa,ya)dy, 

1=1 •' do 

where Z = 1, 2, 3 and m = 2, 3, 1. Analytic forms of geometric coefficients can 
be obtained. 



4 Numerical Solutions 

After calculating Lkijay, and Nkija-y, equation (3.1) can be solved by 

using an iterative method. The pressure tj /3 on T„ and velocity on T* can 
be calculated first. We have N x N linear equations 

2^fca “ S/3in Lkijaf3'nii3V^'^ — J2j3j ^kja/Stjj^ 

+ S/3ij ^kjaynij^vli^Vji^ — 

where k = 1, 2, 3, a = Ti, T 2 , . . . , Tat, and 

'^ka— J2/3ij d^kija/SnijSVjjS — 'Yhpj MkjafftjiS 

Y '^pij d^kja^V^i^Vji^ni/s — dYkija^Vig^ . 

The boundary stress and velocity are given by boundary conditions. 

5 An Example and Conclusions 

As an example, to compare with [3], we calculate the viscous flow in a rectan- 
gular cavity having a rectangular obstacle. The rectangular cavity is 10 x 12 
and rectangular obstacle is 1.7 x 1.7. The boundary conditions are 
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Ftg3 {[o'*' field disiribuiidR 



Fig^ vrlcx:iiy dtsij ibuiion 



Using a triangular grid, the viscous flow is calculated when = 30, 40. The 
flow fleld distribution is shown in Fig. 3. In Fig. 3, a big vortex is formed 
on the top of cavity and inclining to the left, two small vortexes are formed 
on both angles of the cavity. Our results agree with [3] well. The velocity 
distribution in the x direction in vertical centerlines and the y direction in 
horizontal centerlines are shown in Fig. 4. Its changing laws agree with 
experimental data. For accelerating convergence, we u|^ a operator 

4a'=4a + 0.6K+l-uL). 

The resulting scheme is effective. 
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Nonlinear Convection-Diffusion Problems 
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Abstract 

In this paper we construct a characteristic difference method for 
two-dimensional nonlinear convection-diffusion problems. These pro- 
blems arise in the modeling of fluid flow and transport in porous media, 
for example. The method is analysed mathematically and an error bo- 
und is derived. Convergence of the method can be achieved under a 
milder restriction in the temporal stepsize and spatial stepsize than 
those required by existing methods. 



KEYWORDS: convection, diffusion, characteristic, difference method, con- 
vergence, error estimate 



1 Introduction 

Nonlinear convection-diffusion problems arise from many real world problems 
such as underground percolation, air-pollution. Hows, and chemical diffusion. 
Various characteristic finite element and characteristic finite difference me- 
thods P El E| have been developed to solve the convection dominated linear 
problems. However, the convergence of those methods requires At to be in 
the same order as h, namely At = 0{h). Duran |2] improved the methods to 
allow At to be in the higher order of h. But his analytic way of convergence 
can not be applied into the nonlinear problems. 

In this paper, we construct two characteristic difference methods based on 
the linear and quadratic interpolations for two-dimensional convection domi- 
nated nonlinear convection-diffusion problems. The methods are then analy- 
sed mathematically and formulas for the estimation of errors are developed. 
The problem considered is the two-dimensional nonlinear convection-diffusion 
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problem with the Dirichlet boundary conditions: 

c{x)^+b{x,u) ■ Vm- [^(ai(x,u)f^) + §^{ai{x,u)^)] 

= f{t,x,u) inf]x(0,T], (1.1) 

u{t,x) = Oon dfl X (0,T], u(0,x) = uq(x) in fi, 

where H is a rectangular domain, dfl is the boundary of 12, x = (x,y), 
h{x,u) = (6i(x, m), 62(0:, u)), and ai{x,u) {i = 1,2) are much smaller than 
c{x) and b{x, u). 

Assume that the solution of equation (1.1) exists and is unique and the 
coefficients c, b, ai {i = 1, 2) and initial function uq satisfy the following 
conditions: 

b) ||'Wo||L~(n) < Ki, ||u||L~(o.T;L°°(a)) < Ki- 

ii.) For V(x,p) G 12 x [— 2ATi, 2ATi], there exist 0 < ci < c(x) < C2, 
\H^,P)\ < K 2 , ai{x,p) > ao > 0, t = 1,2. 



iii.) The first order partial derivatives of bi{x,p) and ai{x,p) are bo- 
unded and f{t, X, p) is Lipschitz continuous with respect to p G 
[-2X1, 2X1]. 

Here Ki, K 2 , ci, C2, and ag are positive constants, 1F™’^(12) denotes the 
Sobolev space of m degree on 12, and W^’°°(0,T; = {v, 

[0, T] -)> VF™’P(12) and ||u||w'*.°°(o,T;W”*.p(a)) < +oo> s = 1, 2, . . . , fc}, where 



u| |rv«'“(0,T;W'"'P(O)) = max ess sup 

0<i<s 0<t<T 



d^v 



For simplicity, in the numerical analysis, let A(0,T;Z(12)) = A(Z(12)), 
Mi {i = 1,2,...) denote bounded positive constants, and M(si,S2, •■•,Sr) 
be a positive constant depending on si, S2 , . . . , Sr- 



2 Characteristic Difference Methods 

Let 'ip{x, u) = sj c(x)^ -I- |6(x, u) p = c{x)^ + bi{x, u)^ + 62(2^5 u)^ and t(x, u) 

denote the characteristic direction of the operator c{x) ^ +b{x, u) -Vu. Then 

du c{x) du b{x, u) 

dr 'ip{x,u) dt ip{x,u) 

Along the characteristic direction, equation (1.1) becomes 

ip{x,u)^ - [|^(ai(x,u)ff) -h ^(ai(x,M)f^)] = f(t,x,u), 
u(t,x)=0, u(0, x) = Uq(x). 



( 2 . 1 ) 
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Let flh denote a mesh partition of the domain fl, Xq and Yq be the side 
lengths of ^ and /12 = ^ be the spatial stepsizes, At = -^ be the 

temporal stepsize, and t„ = nAt (n = 0, 1, . . . , N). To solve (2.1) numeri- 
cally, approximate the derivative along r by the difference quotient at point 
(xi,yj,t„), namely 



where Xij = {xi,yj), Xij = Xij -b{x,j,u2j)At/c{xij), c{x,j) = c{xi,yj), 
h{xi,j,ulj) = bk{xi,yj,ul^) {k = 1,2), u^j = u{xi,yj,tn), and = 

u{xij, t„_i). Further, by intruducing 






n _ n 



hi 



hi 



the partial derivatives of u with respect to x and y in (1.1) can be expressed 

by 



bx,-{o,i6x,+u)'2j — hi [fli i_|_ 1 (M"_)_iy 
5y-{a25y^+u)lj = /i^^[a2.i-Hi«+ij 

where = ^[ai(xij,u2j)+ai(x^+ij u 

+a2{xi+i^i, <+i_j)]- 

Assume that is known and let 



- - <-iy)], 

- <i) - a2.i-i(M"i - 

”+i,i)] anda2,i+i = 5 [a2(xiy , u”^-) 



= Xi^j - b{xij,UJ'j '^)At/c{xij) 



( 2 . 2 ) 



be an approximation value of Xij, where Xi^ and Xij refer to quantities at 
tn but for simplicity, we omitted the superscript n. 

Let U'^~^{x) be the piecewise interpolation function obtained from {U^J^} 
on the spatial mesh and = C/"“^(iij); then a characteristic diffe- 

rence scheme for the solution of equation (2.1) is 



Ci, 



Ki-K 

' Ai 



- [5x,-Wx,+U)'^i + 5y^.{A25y^+U)^A = f{tr,,x 



jjn- 



t = 1, 2, . . . , Ji - 1, j = 1, 2, . . . , J 2 - 1, n = 1, 2, . . . , A, 

^ ^ I 5 2 , . . . , t/i 1 , j 1 , 2 , . . . , (/2 1 , 

c/" = 0, G an 

(2.3) 

where = b[ai(xij, L^”^"^) -k ai(xi+ij, [//(("ib)] and A 2 _i+i=i[a 2 (x,j, 

UlJ^) +a2(x.+i,„ t/f+lb)]- 




A Characteristic Difference Method 



381 



Remark 1. The error of the characteristic difference scheme (2.3) on the 
time direction is of order 0{At). 

Remark 2. If the interpolating function is piecewise bilinear, 

(2.3) is called a linear characteristic difference scheme. If is piecewise 

biquadratic, (2.3) is known as a quadratic characteristic difference scheme. 

To increase the accuracy of time integration, we discretize the first term 
of (2.1) by 






>i,j 



^ ^ Pit 



1p{3 






= c(a;*j)- 



At 



where = Xij - b{xij ,uf j)At/c{xij), Xij = Xij - 2b{x^j,u2j)At/c{xij), 

= u{xij,tn), and u^ j = u{xij,tn)- This scheme has the second-order 
accuracy on the temporal direction. 

Assume that {17”^”^} and {C/"“^} are known and U^~^(x) and U"‘~^(x) 
are obtained by the piecewise interpolation bilinear or biquadratic functions 
of {U^J^} and on the spatial mesh O/j, respectively. Then we obtain 

the multistep characteristic difference scheme for (2.1) 



§(7”,--2(7”7^ + iC/ 



'"1,3 



% - %AAi5.,+U)l^+^AM5y,+U)l^] 



= f{tn,x,^j,ec 1), i = 1,. . . , Ji - 1, j = 1,. .. , J 2 - 1, n = 2,...,N, 
^i,j ~ i = 1, 2, Ji 1, j = 1, 2, . . . , J 2 1, 

= 0, s,,, G dfi, 



: n-2 



(2.4) 



where — U (^Xi^j ■; tn—l') j ^ i^j — J ^n— 2) J 

0”-i) At/c(s.,,), k,j = Xi,j-2b(xi,y,0l-^) At/c{xij), 0.,, = 2U^-^-U^-^ 
^l,i-|- 5“2 and A 2 = 2 [®2 ) 

-|-a2(xij+i, 0ijh)]- 

With (2.4), we obtain C/”^- for n = 2, 3, . . . while {U^j} can be obtained 
by the Taylor expansion combining with equation (1.1), namely 



U}j = uo{xij) + 



c(Xij) 



'ip{xij,uo{xij)) dt 



,du. . 
(o7)oAt, 



where ||u^ — U^\\ = O(Aff). 

Similar to the characteristic difference scheme (2.3), (2.4) is known as the 
multistep characteristic difference scheme. 
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To analyse the convergence property of the characteristic difference me- 
thods (2.3) and (2.4), we define the inner products and norms for the mesh 
functions Y and Z on as follows: 



(y, z) = yijz,,jh^h2, iirip 

= |[r|p 

{Y,z] = ^t\yJo~' ||r]p 
[Y,z] = 'Etj^lyi,jZi,jhih2, |[y]p 



(Y,Y), 



(Y,Y), 



(Y,Y), 



3 Convergence 

of the Characteristic Difference Scheme 



In this section, we give the convergence analysis of the singlestep characte- 
ristic difference scheme (2.3). The convergence of multistep characteristic 
difference scheme (2.4) can be proved similarly. 

Let u{x,t) be the exact solution of (2.1) and be the approximate 
solution obtained from the characteristic difference scheme (2.3). (2.1) is 
discretized at (xij,tn). We have 



yj} . — ^ 



At 



where ft”,- ^ = u 
Let C" = u” 



- [4,_(ai4,-HM)”j + 5y,-{a25y,+u)l^\ = /(t„, u”^-) -h e^ 

(3.1) 

= u{xij^,tn-i) and efj is a local truncation error. 

— U'^. Then from (2.3) and (3.1), we have 






At 



- + 5y_{A25y^+C)lj\ = 

,-((ai - + 6y_{{a2 - A2)5y^+u)l^] 



+c, 






At 






Multipying equation this equation by summing for i = 1 to Ji — 1 

and j = 1 to J 2 — 1, noting that = ^i,o = ^i,J 2 ~ using the 

discrete Green formula, we have 



e-e- 

At 



-,e) + [^i4.+r,4.+D + {A2Sy,+e,Sy,+^^] = (/(t„,X,U") 



-f{tn,X, [/” ^),C) + (4,-(ai - Ai)4_ + u)”,^”) -h ((5y _((02 - A2)Sy^ + u)^, 



D + (c 



Hi 



^{x 

At 



,e) + {e^,e) = h + i2 + i3 + h, 



(3.2) 
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where c, ak, A^ {k = 1, 2), and are the mesh functions on such that 
c = {c(xi,i), c(xi, 2 ), ■ ■ ■ , c(S etc. As 



\u —u 



n— li |2 



J 1 J 2 

i,j=0 



n— 1 






At 



~Yhih 2 < M{\\u\\w^.°°(L°°))At^ , (3.3) 



by the assumption iii.), we have an estimate for the first term on the right 
hand side of (3.2) 

|/ii< - u^-^\\\\e\\ < ^3(iiw” - + iir-'iDiieii 

< M(||«||^.,<»(^^))(||e"|p + Iir-^IP + Af2), 

where K 3 is a positive constant. For the second term on the right hand side 
of (3.2), we have 

\h\= |((ai - Ai)"(5,,+u", 5,,,+r)l + I((a2 - A^rSy^+u^, Sy,+C)\ 

< e[|[^..+riP + H<5y.+r]p] + Mi[|[(ai - Ai)"4,+u"||2 

+ 11(02 - A2)”i5j^, + m”]P], 
where ^ is a positive constant. Since 

Ji — 1 , J 2 — 1 

|[(oi - Ai)"J,,+o"||2 < ^ ((oi - Ai)r+i,,)"/^i/i 2 , 

i=0,i = l 



from assumption iii.), we have 

|(oi - Ai)"^,_.| < -ur~^\ + K+i,,. - C/r+ll, 



1} 



< - <-'l + K+I., - <+it'l + IC7'I + \&} 

which, together with (3.3), yields 



||(oi - Ai)"4,+u”||^ < M(||o"||^i,=o, ||u||wc~(L~))(||r-1r + AC) 



Smiliarly, we can obtain 

||(02 - A2)"5,,+o"|P < mul\wc^,\\u\\wc^^L^)){\\C-^f + AC). 



Therefore, we have an estimate for the second term of the right hand side of 
(3.2), namely 

1 / 2 I < s[\\s^,+af+\\s,,+e\f]+mui\wc^M\wc^(L^)^^^^^^ 
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We now estimate the third term of the right hand side of (3.2). By 
assumption iii.), we have 



- iij)| = - bi{x^,j,UY )| 

- b2{x,j,ur-^)\ < i^3(K, - <-'l + le”-'!), 



which yields 



i/3i< M(iK-i|ki,oo)(iK - + iir-'iDirii 

< m(ik-i||h.i,oo IP + ir-'ip ^ 



If |Vm| S L°°{L°°), from assumptions i.)-iii.), there exists a constant K* 
such that 

b{x,u{x,t)) d b{x,u{x,t)) 

■ c(i) l + c(x) 

Similar to 0, we define the norm for a{x) € by 



Jl — 1, J 2 — 1 

'^llp “ max{|o;(a;)p, \x — Xij\ < K* At}hih2- 

ij = l 



It is also not difficult to obtain 

||e”|p < M,{\\u-\\Uht + ht) + 

Therefore, we have 

1 1 7 / 

|/4i< 2iiriP + 2^i(iKiiff^(^i + ^2) + ii^iii.p„_,,*„;P)At^). 

We now consider the estimate of left hand side of (3.2). By the triangle 
inequality and assumption ii.), we have 

/1+/2+/3+/4 > ^[(cr,r)-(cr-\r-')]+ao[i[ 4 .+riP+ii< 5 ,,+r]p]- 

where = ^”“^(i). To simplify this inequality, we need to find a relation 
between (c^”-p^-^) and (cf’^-M"-!). Set 

„ b^{x.,„U^-^)At b,{x,,„Ur-^)At 

Then Xij = (ij, y^), and Xi = Xi — a^jhi, = yj — Id^jh 2 - By assumptions 
i.) and ii.), we can choose At and hi, /12 such that 
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Hence Xij = € [xi-i,Xi+i] x [yj-i,yj^i] and the point Xij must be 

in one of the four domains I, II, III and IV as shown in Fig. I. 

Ci-i,yj + i) {Xi,yj + i) (Xi+i,yj+i) 4{xi,yj + i) 3(a:i+i . y^ + i) 



III 


II 

) (a^i+i, 


IV 


I 



(xi-i,yj-i) (xi,yj-i) (xi+i,yj-i) l{xi,yj) 2{xi+i,yj) 



Figure 1 Figure 2 

For simplicity, we have omitted the superscripts n and n — I for Assume 
that the point Xij is in domain II and the value of U'^~^{x) at is obtained 
by the piecewise bilinear interpolation of The bilinear interpolation 

function is 



Ti(C)(a:, y) = Nx{x, y)^^j + N^ix, y%+xj + Nz{x, j+i + ^4(0;, y)C»j+i, 



where ^ = u — U and Ni{x,y) denotes the interpolating function correspon- 
ding to (xi,yj) as shown in Fig. 2. Thus we have 

Tl (0 y j) ~ ^x^ + Ci,j + h2~hCHij [3ij5x^+ (^ 2,4 + 1 ^i,j)h\. 



Hence we see that 



i]=? ^ yj)hih.2 

■V Jl — 1, J2~l 



< " Cijiljhih2 + cl=i [|4.+6,iP 

+ |( 5 y,+^ij+ip]/ii/i 2 + M 2 AtY,‘^^_lj=i~^ \^i,j\‘^hih 2 - 

Similarly, for Xij G II, we can prove that the above inequality holds for Xi^^ € 
I, HI, and IV. Therefore, we have 



E-Ei < (ce-\e~^) 

+£[i[E.+e-'iP + ii<5,.+r-']p]At + M^iie-'ii 



Noting that 



^{Xi,yj) = u{x„yj) -U{xi,ijj) = u{x„yj) - Li{u){xi,ijj) + Li(^)(ii, y^-), 

where Li{u){x,y) denotes the bilinear interpolation function of {uij} on Clh, 
we can estimate u{xi, y^) — Li(u)(ii, y^) as follows. Let G II and assume 
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that II is divided into four equal rectangular elements f2i, fl2, H 3 , H 4 , and 
Xij G fli- Then we have 



\pi\ = \xi - < min(^,A:At), 

IP 2 I = \Vj -Xj \ < min('^,fcAt). 

Further, we expand u{xi+\,yj), u{xi+\, 
Vj+i), u{xi,yj+i), and u{x^,ijj) as the 
Taylor series at the point {xi , yj ) by 



+ 3(xi+i, yj + i) 



04 


Q 3 




0,2 



i{xi,Vj) 2{xi+i,yj) 



Figure 3. 



u{xi+i,yj) = + + ^{^){x^\yj), 

u{xi+i,yj+i) = u,j + + h2i^)ij + 

+^i ^2 ) + f & ’ y'" ) ’ 



u{xi,yj+i) = +/i2(|f)iy + 

u{xi,yj) = Wiy + Pi(|y)*y +P 2 (ff)*j + ,y') + piP2 



d^u 

dxdy 



{x',y') 



pI d^u 
2 dy'^ 



(x',y'), 



where x^>‘ = {1 - 6k)xi+i + 9kXi, y^'^ = {1 - 9k)yj+i + 9kyj, x' = {l-9)xi + 
9xi, y' = {1 — 9)ijj + 9yj, 0 < 9k < I {k = 1, 2, 3), and 0 < 6 * < 1. Thus we 
have 



|Li(m” - u” '^{x^,i/j)\ 

< M(||'u”“^||// 2 )[ft,i min(^, iFAt) + /12 min(^, ATAt)]. 

Similarly, for Xij G f2i, we can prove that this inequality holds for Xij G 
ri 2 , fla, fl 4 - Therefore we have the following lemma. 



Lemma 3.1 // [/" ^(a;) is obtained by the bilinear interpolation, we have 



(^n-l ^n-l^ _ ^ J£n-l\2k. ^ „ .r‘2^cn-l\ 



Ci,Mj^?h^h2 < Ei!~J{'^~^c.,jLlie-^)hih2 



+MJ2ij=Y^ - Li(m" '^){xi,yj)\‘^hih2 

<(cr-\r-')+£[ii4,+e”-'ip + ii<5y.+r-'iP]At+M2iir-i|pAt 

+M{\\u^~^\\H^)[hi mm{^,KAt) + h2 min(^, iGAt)]^. 
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(xi-i,yj+i)4 7{xi,yj + i) 3(a;i+i , i/^ + i ) 



If the value of C/”“^(x) at 
point Xij € II is obtained 
by the piecewise biquadra- 
tic interpolation of 
the biquadratic interpola- 
tion function is 





QCi,yj) 







Ci-l,yj-l)l 5 (xi,yj-i) 



S(xi^i,yj) 



2(xi+i,yj-i) 



Figure 4 . 

^2(0(x,y) = -h JV2(x,y)^i+ij-i + iV3(a:, j/)^i+iy+i 

+N4(x, + N5{x, y)^t,j-i + Nq{x, y)^i+i,j + Nr{x, y)^ij+i 

+Ns{x,y)^i-ij + Ng{x,y)^^j, 



where Ni{x,y) {i = 1, . . . , 9) denotes the interpolating function corresponding 
to node i as shown in Fig. 4. Then we have 

~h40ifjf3ij[6x,+ {^i,j-l — ^i-ij-i) — i5x, + (^ij + l — 
d~4(^i,jPi,j[^y,+ {^i-l,j ~ ~ ^y,+^i+l,j ~ Ci+lj'-l)]^2 

~ — l,f — l) ~ ~ Cij — l) 

Similar to the bilinear interpolation, we also obtain an inequality for the 
biquadratic interpolation through a tediously long computation, namely 

c^,]Ll{0{xi, yj)hih2 

<H--\e-^) + MM^-^\\^^t + e[\[5x,+e-^\\^+\\Sy,+e-T]^t 

< M(||'u”“^||//3)[h2 min(^, iFAt) -|- min(^, iFAt)]. 

Therefore, we obtain 



Lemma 3.2 z/C/" ^(a;) is obtained by the biquadratic interpolation, we have 

< (c?"-i,r"')+M3ir"'ipAt+£[|[4.+r-'ip 

-|-||5y,+^”“^]p]At -I- M{\\u^~'^\\H3)[h\ min(^, ATAt) -|- min(^, iFAt)]^. 
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From (3.4), the inequalities for li {i = 1, 2, 3, 4), and C/” ^(x) obtained from 
the bilinear interpolation, we have 

i[(ce,D - + ao[ii4,+r-i|p + ii<5,,+r-'iP]At 

<s[\\S,,+a\^+\\Sy,+^-\\^]At + e[\\S,,+e-^\^^ 

+M(||u||wi.=°(l°°), ||u||j;^cx=(^^i,oo))(||^”|p + + At^)At 

+ iMl(||M||j;,oo(^4))(/lf + hj) + |||:^||j;^2(t^_j_(^.f2)At^)At 

+M(||u”“^||fl-2)[/ii min(^, ATAt) + h2 min(^, iFAt)]^. 

Choosing e = ^ and summing up the inequality in Lemma 3.2 for n, we 
obtain 

irip + ao e:=o[ii«'"IP + ii<5..+rip]At 

< M{\\u\\w^,<=o{L<^), ||m||loo(// 4), |||yT||i2(f2))(X]m=0 HC^lPAt + hf + 

+Af2) + M{\\u^-^\\H2)[hi min(^^,iFAt) + /12 min(^^, ArAt)]^. 

Applying the Gronwell inequality, we obtain 

irii + {Ei=o{\\s^,+r\? + ii^,.+rip]At)5 < mu\\w^.^iL<^),\\u\\L^(H^), 

1 1 1 1 i 2 (, 2 ) ) ( [/ii min( ,KAt) + /12 min( ,KAt)] + At) . 

(3.5) 

Therefore, we have the theorems. 

Theorem 3.3 Let u G 4F^’°°(L°°) fl L°°{H'^) be the solution of equation 
(1.1) and € Lf{P), and let {Ufj} he the solution of equation (2.3) based 
on the bilinear interpolation. Then the error estimation (3.5) holds. 

Theorem 3.4 Under the assumptions of Theorem 3.3 about u and the so- 
lutions {Uf'j} of equation (2.3) based on the biquadratic interpolations, the 
error ^" = u" — C/" satisfies the following inequality 

iirii + (Eo^[|i<5.,+riP + ii^y.+rf ]At)i 

< M(||u||^^l, 00 (^ 00 ), ||m||^cx=(^4), |||:;|||^2([2))(^? + h\ + At). 
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Fractional Step Methods for Compressible 
Multicomponent Flow in Porous Media 

Yirang Yuan 



Abstract 

This paper discusses characteristic finite difference and finite ele- 
ment fractional step methods for three-dimensional multicomponent 
flow in porous media. Optimal order estimates in the L^-norm are 
derived for the errors in the approximate solution. 



KEYWORDS: compressible flow, characteristic finite difference and finite 
element, fractional steps, L^-error estimates 

1 Introduction 

In modern numerical simulation of prospecting and exploiting oil-gas resour- 
ces, the problems met are often three-dimensional, large-scale, large-scope 
and extralong-term ones. The node number is as large as tens of thousands 
or even hundreds of millions, which calls for the new technology of fractional 
steps to solve the problems. First, a kind of characteristic finite difference 
fractional step methods is developed. Thick and thin grids are used to form a 
complete set of techniques, such as the piecewise product threefold-quadratic 
interpolation, the calculus of variations, the multiplicative commutation rule 
of difference operators, and the decomposition of high order difference ope- 
rators. The prior estimates and techniques are adopted. Optimal order esti- 
mates in the L^-norm are derived to determine the errors in the approximate 
solution. Next, we establish a kind of characteristic finite element operator- 
splitting methods and use the operator-splitting, characteristic method, the 
calculus of variations, the energy method, negative norm estimates, and the 
theory of prior estimates and techniques. Optimal order estimates in the 
norm are derived for the errors in the approximate solution. These methods 
are successfully used in oil-gas resource estimation, enhanced oil recovery 
simulation, and seawater intrusion numerical simulation. 

Z. Chen, R.E. Ewing, and Z.-C. Shi (Eds.): Proceedings 1999, LNP 552, pp. 390-^^^ 2000. 

(c) Springer-Verlag Berlin Heidelberg 2000 
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The mathematical model for the three-dimensional compressible multi- 
component displacement problems is the nonlinear partial equations with 
initial-boundary conditions 1311011 El 

dj) 

(а) d{c)-^ + V -u = q{x,t), x € n, teJ=( 0,T], 

(б) u = — a(c)Vp, X € fl, t € J, 



and 



+ ba{c)^ + u ■ Vc„ - V • (DWca) = g{x,t,Ca), 
ot ot 

X € fl, t G J, a = 1, 2, . . . , ric — 1, 



( 1 . 2 ) 



where p{x, t) is the pressure function, Ca{x, t) is the concentration of ath com- 

Uc 

ponent, a = 1,2,..., ric, ric is the number of components, X) Ca(x, t) = 1, 

ric 

d{c) = d)(a;) ^aCa, d>(a;) is the porosity, is the “constant compressibi- 

a— 1 

lity” factor for the ath component, u is the Darcy velocity, a(c) = k{x)p{c)~^ , 
k{x) is the permeability of the rock, /i(c) is the viscosity of the fluid, ba{c) = 

Uc 

^(x)ca{za — X) i® til® diffusion coefficient. Let c(x,t) = 

1=1 

(ci(x,t),C 2 (x,t), . . . ,Cn^-i(x,t)) . Then the pressure function p(x,t) and 
the concentration functions c(x, t) are obtained. 

Here we assume that no flow occurs across the boundary: 



(a) M • 7 = 0, x G dfl, 

(b) {D\7ca — Cau) ■ j = 0, X G dfl, a = 1,2, . . . ,ric — I, 
where 7 is the outer normal to dll. In addition, the initial conditions 

(a) p{x,0) = po{x), xGfl, 

(b) Ca{x,0) = Cafi{x), X G 11, a = 1,2, ..., Uc ~ 1, 



(1.3) 



(1.4) 



must be given. 

For planar two-phase immiscible flow, Douglas, Ewing, Russell, and many 
others have published a series of fundamental papers on the characteristic 
finite difference and finite element methods pun 0. For compressible 
two-phase displacement problems, Douglas and others have published their 
papers P 0 0 P . On the basis of the preceding works this article makes a 
further study of the characteristic finite difference and finite element fractio- 
nal step methods and L^-error estimates for a three-dimensional compressi- 
ble multicomponent displacement problem. The three-dimensional problem 
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is decomposed to solve three one-dimensional problems continuously, thus 
greatly reducing the amount of computation work and making the actual 
computation of the project possible. This method is used successfully in oil- 
gas resources estimation pa mg, enhanced oil recovery simulation paim, 
and seawater intrusion numerical simulation pam^ 

Generally, this is a positive definite problem 



(a) 0 < a* < a(c) < a*, 0 < d* < d{c) < d*, 0 < Z?* < D{x) < D* 



da, . dd , . 



< K* 



(1.5) 



where a*, a*, d*, d*, Zl*, ZZ*, and K* are constants. Our assumptions on the 
regularity of the solution of (1.1)-(1.5) are given collectively by 

d‘^T) d‘^ c 

p, c„ G n ^ G 

a = 1,2, . . . ,7ic - 1. 



Assume that O = {[0,1]}^ and problem (1.1)-(1.5) is 0-periodic. Then 
the boundary condition (1.3) can be dropped niziini. 

In this paper M and e express a general positive constant and a general 
positive small constant, respectively, and they may have different meanings 
in different places. 



2 Finite Difference Fractional Steps 



Let n = {[0, 1]}3, h = 1/N, Xijh = {ih,jh,khf, Z" = nM = 






ijki 



( 2 . 1 ) 



(«) ^r+i/2.,fe = C'^+p.-fc)] /2, 

{b) o,'i+ii2,jk~ '^Gifc) + j'fe’ '^r+i j'fe)] /2j 

and <,j+i/ 2 ,k^ and be defined analogously. Let 



An ( pn+1 pn+1 \] 



_ pn+l 
ijk 



(2.2) 



An ( pn+1 pn+1 \] 

(c) - p^k) 

_ An p^+1 ^1 

— 1/2 ^ ij,k — l) \ ’ 

(d) V/,(A"VP”+i),,fe = fei(A-4,P"+i),,fc 
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For the fluid equation (1.1), the finite difference fractional step scheme is 
given by 



pn+l/3 pn 

(a) 

+4a(^"43^")zjfc + 1 < f < IV, 

pn+2/S _ pU+l/S 

(b) 1 < j < iV, 

^^^+l ^n+2/3 

(c) rf(gg-fc)^ l<k<N. 

(2.3) 



Compute the approximate Darcy velocity U = (t/i, U 2 ,Us)'^ as follows: 



= — 

^ l,ijk 



A 



P 






i+l,jk 



pn 

^ijk 



+ A^: 



i-l/2,jk~ 



pn pn 

^ ijk ^i—l^jk 



(2.4) 



with and being the corresponding average in the other direction. 

As the flow is essentially in the characteristic direction, we apply the 
modified method of characteristic to the first-order parts of (1.2), thus en- 
suring a high accuracy of the numerical results DlllEIHI. Let 4'(x,'u) = 
[4)^(a;) + and d/dr = ^{^d/dt + u ■ V}. Equation (1.2) can be 

rewritten in the form 



i’ 



dCg 

dr 



V- 



(PVCa) + bg{c) 



dp 

~dt 



g{x, t, Cq), X G fl, t G J. 



dc 

Approximate — ^ — = -;:g^(a;, t"’’’^) by a backward difference quotient in 
dr dr 



T-direction, 



— c” (a: — 4) ^(a:)u”+^(a:)At) 



dr Aty/l + 4>-2(x)|'u"+i(a;)|2 

For the system of concentration equations, the characteristic difference 
fractional step schemes are given by 



^n+1/3 

(a) 1.,, 

+S,JDS„CS)i,i + i,,(pS„C2)iii 

pn-^1 pn 

I < i < N, a = 1, 2, . . . , 7T.C — 1, 
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^n-\-2j3 ^ 71 + 1/3 

(b) = 5,,{D5^,{C2+^/^ - O)..,, 

^<j<N, a = 1,2, ... ,ric - 1, 

^n+1 ^n+2/3 

1 < fc < TV, a = 1, 2, . . . , ric — 1, 



(2.5) 



where we interpret C”(x) as the piecewise threefold-quadratic interpolation 

[15], = C-{X-,), and 1”, = ~ ^7jiU-,At. 

The initial approximation is given by 



^ijk ~ Po(^ijk), 

^<i,j,k< N, 



^a,ijk ~ Ca,0,ijk, 



a = 1,2, . . . ,ric - 1. 



( 2 . 6 ) 



The algorithm for a time step is as follows: Assume the approximate 
solution C2,ijki^ ~ 1, 2, . . . , Uc — 1)} at time t” is known. First, from 

scheme (2.3a), the method of speed up is used to get the solution of transition 
sheaf along the xi direction. From (2.3b), we obtain and 

from (2.3c), Next, from scheme (2.5a), by using the method of speed 

up, we get the solution of transition sheaf along the Xi direction. 

From (2.5b), we obtain and from (2.5c), So a complete 

time step can be taken. Finally, because of the positive definite condition, 
only one solution of this problem can be obtained. 



3 Convergence Analysis of Finite Difference 

Let TT = p — P and = Ca — Ca, where p and Ca{a = 1,2, ... ,ric — 1) are 
the exact solutions of this problem and P and Ca{a = 1,2, ... ,ric — 1) are 
the difference solutions. 

First, consider the fluid equation for equation (2.3a), by using (2.3b) and 
(2.3c), we get the equivalent form 

pn+1 pn 

-( At)2 {4^ (d-HC”)4, (A"5, J)) + (d-i(C") 

+ S,M’^S,Ad-HC^)b,M’^S,MdtPrjk 

+ (At)34^(A"<5,,(d-i(C")4,(A”4.(d-'(C") 

■ • ■)^Jk, l<t,j,k< N, 



(3.1) 
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where - PPj^}/At. By (1.1) {t = and (3.1), we have 

the pressure error equations 



7r”+^ — tt” 

d{cr,k) - V,.(^"V^7T"+i),,fc = -(At)2{5,^(A"4, 

Fdx2(A'^dx2{d ^ {C'^)5x2{A'^Sx2)))}dt'!Tfj^. + {At)^Sxi(A'^Sxi 
{d-^{C^)Sx2{A'^Sx2{d-\C^)6x2{A^Sx2dt7T^) . . .).,fc 

+(At)2 {4^ {A^6x, {d-\C^)6x2 (A^Sx 2))) + {A^dx, {d~\C^) 
S^M^Sx^))) + S^2{A^6x2{d-^{Cn5^2iA^6x,)))}dtP^jk 



-(At)34^(A"<5,,(d-i(C")4,(A"4.(d-'(C") 

;; f Anx a „n\ , „n+l 



here = — (7 t"+ 1-^") and |a”+i| < m{ 



i,j,k < N; 




f 


d'^p 




dp 


i 


dC 


ioo(ioo)’ 


Ik 



L°°(iy4,oo) 



lbllL“(iv4.cx,), ||ca|bcx,(^3,=o) (a = 1,2, ... ,nc - l)|(/i^ + At). Suppose that 
the space and time steps satisfy 



At = 0{h^). 



(3.3) 



By testing (3.2) against <5t7r”-fc = dt7r”fcAt = - 7r”-j, and summing by 

parts, we have 



< d(C")di7r",dt7T" > At+^{< V^7t"+i > 

- < ^”V?,7r", V/,7 t” >} < M{/i 4 + (At)2}At + £|dt7r”|^At 
-(At)3{ < SxM"Sxdd-\C^)SxAA^6x2dtTTn)),dt7T^ > 

+ < Sxi{A'^Sxi{d-^{C”^)Sx^{A^6x2dt7T^))),dtTT”^ > + < 

(d-i(C")5s3(A"43dt7r"))),dt7r" > } + (At)4 < S^,(A"Sx, 
(d~'^{C'^)Sx2{A'^Sx2{d~'^{C^)6x3iA'^Sx3dtTT^) . . .)ijk,dtTT’^ > . 



By the multiplicative commutation rule of difference operators and the de- 
composition of high order difference operators, we obtain 



|dt7T"|2At + i{< > - < A^VhTT^, VhTT^ >} 

< M{|7t”+ 1|2-H |7T"|2-H/l4-H(At)2}(At). 



Next, consider the concentration equations. For equations (2.5a), (2.5b) 
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and (2.5c), we get the equivalent from 

^n+1 An pn-\-l pn 

- ^h{DVhC^+%k = -&a(C"fc) 

- {At)^S,,{DS,,{<p-^S^,{DS,J)) 

+^xi{D5xi{^ ^5 x^{D6x 3))) + ^^x3{D5x^)))^dtC'i!^.^jf. 

+{Atf5x, {D5x, {<^-^5 x^{D5x3 (^"^<5^3 {DSx^dtC^) ■ ■ O^fc, 
^Ai,j,k<N, a = 1, 2, . . . , ric — 1. 



Similarly, we can obtain 

\dtai^t+ < V/.C+' > - < . , 

< M{\C\l + |r+i|? + \Vh7T% + h^ + (At)2}, ^ ^ 

where |^|f = |?lo + iV^CIo- 

For (3.5), summing on 0 < n < F and for (3.7), summing on 0 < n < L, 
1 < a < ric — 1, we have 



E \d,7T-\lAt+\n^+^\l 

"=° L-l L (3-8) 

<^E \dtC\l^t + M{h^ + {Atf+ E k”+'l?At}, 

n—0 n—1 



and 

L 



^ IkrioAt + < M{h^ + {Atf + J2 [ir+'l? + k"l?] At}. (3.9) 



n—O 



Combining (3.8) and (3.9) and applying the discrete Gronwall inequality, we 
have 



L 

E 

n—O 



[\dtnX + Mtrio] At + k^+'l? + < M{h^ + {Atf}. (3.10) 



Theorem I Suppose that the exact solution of problems (1.1)-(1.5) sa- 
tisfies condition: p, Ca{a = 1, 2, . . . , ric — 1) € 

dp/dt, dca/dt{a = 1,2, ... ,Uc — 1) € d'^pjdt^, d‘^Ca/dT‘^{a = 

1, 2, . . . , Uc — 1) G L°°{L°°). Adopt the characteristic finite difference frac- 
tional steps schemes (2.3) and (2.5). Let the discretization parameter satisfy 
relation (3.3). The error estimates hold 

ric — 1 

\\p ~ P\\L°°{J-,m) + E \\<^a — CaWloa^j.f^l) + \\dt{p — P)\\l2^J.l2f 

<A—1 

ric—1 

+ \\dt{Ca — Ca)\\laa(^j.il'f < M*{h'^ + (At)}, 



(3.11) 
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N 



where ||/||loo(j.x) = sup ||/”||x, Mmj-x) = sup { X) 

nAt<T nAt<T n=0 



M* — M*|||p||vi/l,oo(^^l,oo), ||p||loo(^4,cx=), 
\\Ca\\w^.°={W^'°°), 



dp 




d'^p 




L“(M/4,oo)’ 


df^ 









d'^Ca 


(lyl.oo), |Cq 1^00(^4,00), 


dt 


Loo(W4.oo)’ 


dr'^ 






L~(L~) 

; (a = 



4 Finite Element Operator-Splitting 

The finite element method for problem (1.1)-(1.4) based on the weak form 
is given by 

(a\ + (a(c)Vp, Vu) = (g(c),u), 

( Oc \ / Ov \ 

V + {u-VCa,z) + {DVCa,Vz) + (ba(c)-^,zj 

= (g(ca),z), zGH^{fl), tGJ, a = 1,2, ... ,ric - 1. 

Assume that d = d\{xi)d 2 {x 2 )dz{x^) and d) = d)i(a;i)<I) 2 (a; 2 )d) 3 (a; 3 ). We 
discuss a finite element operator-splitting method approximation of the fluid 
equation (4.1a). We equidistantly subdivide the region fl. The coding of 
nodes: {a;i,a|0 < a < {x2,/3\0 < P < and {a^s.ilO < 7 < -^ 2 ^ 3 } is 

used. The global coding of three-dimensional mesh region z (i = 1, 2, . . . , N), 
N = {Nxi + 1 ){Nx2 + 1)(A^X3 + 1) is obtained. The tensor product index of 
node z is (a(z), /3(z), 7 (z)), where o;(z) is the number of the xi-axis, /3(z) is 
the number of the X 2 -axis, and 7 (z) is the number of the cc 3 -axis. The tensor 
product basis can be rewritten as products of one-dimensional basis functions 
in the manner 



Ni{xi,X2,X3) = (Pa{i){xi)lpf3(i){x2)uj^(i)ix3) 

= (Pa{xi)lp/3{x2)ujj{x3), l<i<N. 



(4.2) 



If Nh =ip®'tp®uj is a finite element space, let 



w / r A. -\ 

" = 57' S757<'^^>-a7 



qS 



w 



dx2dx3 



e L- 
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Note that C W [22,23]. The approximation properties are given by the 
inequalities 



inf 

xeNh 






m=0 



E 

i,j,k = 0,1 
i + j + k = m 



d^{u-x) 

dx\dx^2^x^ 




< Mh';+^\\u\\k+i, 



(4.3) 



where hp = max{N~^^ , , N~^} is the subdivision step. 

We discuss a characteristic finite element operator-splitting method ap- 
proximation of the concentration equations (4.1b); similarly, we equidistantly 
subdivide the region ft and use the coding of nodes: {xi^vlO < A < 

{3^2, m|0 < fJ. < N^^}, and {x3,,^|0 < x < The global coding j 

{j = 1,2,...,M), M = (Mj,^ -I- l)(Ma ,2 -I- l)(Ma ,3 -I- 1) is obtained. The 
tensor product index of node j is (A(j), /i(j), x(j)) • The tensor product ba- 
sis can be rewritten as products of one-dimensional basis functions in the 
manner 

Mj{xi,X 2,X3) = <^>x(j){xi)^ f,(^j){x2)^^(j){x3) 

= ^X{xi)'i’fj.{x2)^xi^3), 1 < j < Af. 

Let the finite element space = 4) 0 4' (g> fl. Note that C W. 

The approximation properties are given by the inequalities 



3 




i,j,k = 0,1 
i + j + k = m 



d^{u- ip) 
dx\dx{dx\ 






(4.5) 



where he = vn&x{M~^ , M~^ , M~^} is the subdivision step. 

The characteristic finite element two-level operator-splitting scheme of 
problem (4.1): When t = t", if G Nh x are known, we find 

the finite element solution € NhX , t = First, the 

finite element scheme of the fluid equation (4.1a): 



{ddtpl,vh) + {a{cl)VP;^,Vvh) + XpAt{dVdtP^,Vvh) 



+{XpAt)'^ X) 



( 



B-^dtPi: dW 



^ d^dfPJl d^Vh 
8x18x28x3 ’ 8x18x28x3 



8xi8xj ’ dxidxj 



) + {XpAt)^ 



(4.6) 



) = (q(x,t",CJ)),Vh), yvh G Nh, 



K = -a{C'^)WP;:, 



(4.7) 
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where dtPJ^ = - P^")/At, 



S ( 






( d^d^py 


d'^Vh \ 


( d^dtPy d^Vh \ 


V dxidxj 


’ dxidxj J 


V dxidx2^ dxidx2/ 



( d^d,Py 


d'^Vh ^ 


\ 1 


d'^Vh \ 


V dx2dx3 ’ 


dx2dx3^ 


' Vdxadxi 


’ dxsdxi) 



and Xp is a chosen constant. 

As the flow is essentially in the characteristic direction, we apply the 
modifled method of characteristic to the first-order parts of (4.1b), thus en- 
suring a high accuracy of the numerical results [|] Q O Oj- Let = 

[4)^(x) -I- and d/dr = ij}~^ {^d / dt + u ■ V}. We write (4.1b) in the 

form 

+ (DVCa,Vz) + (ba{c)^,zj = {g{ca, z)) , 
z€P[^{fl), t G J, Of = 1, 2, . . . , ric — 1. 
dc 

Approximate — ^ — = -p^ix. t"^^) by a backward difference quotient in the 
dr dr 

. dcl+^ cl+\x)-cl{x-u^+^/Xt/^{x)) 

T- irection, At(l + |2)i/2 

For the concentration equations (4.8), the characteristic finite element 

operator-splitting procedure is 






^n+1 /^7 



At 

+(AcAt)^ 



+ 



3 

E 

d^dtCft . 



^,zh)+ (PVC2,^,Vz/,) + A,At($VdtC"^,Vz/,) 

d^zh 



(4>' 



dxidxj ’ dxidx 



^ + {XcAty 

dxidx2dxs’ dxidx2dxs) ^ (^<^^^h)dtPh , Zh) 

= {9iC^,h)^Zh), yzhGMh, a = 1,2, ... ,ric - 1, 



(4.9) 



d^Zh 



where h ~ x = x — U'l^At/^{x), and Ac is a chosen constant. 

For the fluid equation (4.6), if = E ^a^s-y V^«V’/ 3 ^ 7 > then (4.6) can 

Q!:/3,7 

be written in the form 

E (Q37 - ^a/37) ^7) 

+ XpAt (O 7 -S/37){(Va®#®W^,<^a®V'/3®‘^7) 

+ {dipa 0 '0/3 ® ^^7) + {^Fa ® 0/3 ® w!), ifa C> y/3 C> toly)} 

+ {XpAty J2 - C/ 37 ) {( Va ® ® Ca ® 0(3 ® ^^ 7 ) 



(4.10) 
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+ {dipa ® ® iPa ® ® Uj'^) + {d(p'a P’a 0 V '/3 ® <^7) } 

+ (ApAt )3 ^ - C2/37)( Va ® ® W7, </ 5 'a 0 V'^J ® Ul!y) = AtF^ . 

oc,^,'y 




^xi — 



^X2 — 



^xs — 




di^'ay^^dxiy 

dsu'^y^^dx^y 



(C'aJi + ApAtAj-j) 0 {Cx2 + 0 (C'a;3 + XpAtA^;^) 

(e”+i -r) = At-F”, 



(4.11) 



where 



J^o/ 37 = -(a(C^)VP^", V(<p„ 0 # ® W7)) + (d{Cj:),^c. 0 # 0 u;7). (4.12) 



We point out that (4.11) can be solved by an alternating-direction. 

For the concentration equation (4.9), if C2~\^ = X) Ca 

a.m.x 

can be written in the form 

X (C^^MX - ® ® <I>A ® 4/p 0 n^) 

+AeAt X (C.Amx - C,Amx){(^^a ® ® ^x> ^A ® 4-p ® fl^) 

-h( 4 > 4 >A 0 4 Ap 0 flx> ® 4 'p ® fix) + (^^A 0 4 'm 0 fl'x, 4 >a ® 4 'm ® fl'x) } 

+ (A,At)2 X (C.Vx-QA;.x){(‘J’‘J’A®'l^;®^X:‘J’A®n®^x) 

A.Ai.X 

-h( 4 > 4 >A 0 4 Ap 0 fl'x, 4 >a ® 4 Ap 0 fl'x) + ( 4 > 4 >a 0 4 -p 0 fl'^, 

4>'a ® 4/p 0 fi;) } + (XAt)3 ^X (Ca'^x - ® K ® ^'x. 

0 4';® fi;) = AtGS, a 1, 2, . . . , n, - 1, 

(4.13) 

where 



£>xi = (^ 4>i4>Ai(a;i)4>A3(a;i)(ixi), = (^ 4>i4>'x^4>;^dxi) , 

^^2 = (^ ^24'p,4'p3dx2), B^, = 4>24''p3 4';^dx2), 

Dx 3 = (^J ^3^xi^X2dxsy B^^ = (^ ^sn'^^n'^^dxsy 

{Dxi + XcAtBx^) (Si {Dx 2 + XcAtBx^) O (H^a + XcAtBx^) 
(C+'-C) = AtGS, a=l,2,...,n,-l, 



(4.14) 
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and 



(4.15) 

Similarly, we point out that (4.13) can be solved by an alternating-direction. 

Theorem II Suppose that the exact solution of problem (1.1)-(1.5) is 
smooth. Adopt the characteristic finite element operator-splitting scheme 
(4.6), (4.7), and (4.9). Suppose that k > 1, I > 1, and the spatial and time 
discretizations satisfy the relations At = 0{hp) = 0{hf), = o{hV^), 

and hlf'^ = o{hp^^). Then the error estimate holds 



Tlc — l 

\\p ~ Ph\\L^iJ-,L^{a)} + l|Ca — C'a,/i||i^(j;L 2 (fj)) 

(A — 1 



ric — 1 



+ \\dt{p — Ph)\\L2(J-,L‘^{Q)) W^a — Ca,h\\L2(J-,H^ 



< M*{At + h’f+^ + hi+^}, 



Q! = l 



(O)) 



(4.16) 



N 



where \\g\\L^(^j.x) = sup ||g"||, \\g\\i^(^j.^x) = sup ( ^ 

nAt<T NAt<T n=0 

the constant M* depends on p, c, and its derivatives. 



5 Applications 

The numerical method has already been used in the numerical simulation for 
evolutionary history CEII3, their mathematical model: 

V- ( — Vp) = (a(l-4>)-f /34>)^ -a(l-4>)-^® 

\ ^ / eye • 



(a) 



-f (a(l - 4>) -f /34>)^, X = {xi,X2,xf)'^ t G J, 

dT 

(b) V ■ [KsXT] - c^p^V ■ (VT) + Q = CsPs^, x G fl, t G J, 

( 0 ) 



dt 



(5.1) 






The numerical method has also been used in the numerical simulation for 
enhanced oil recovery simulation [El EH. Their mathematical model is 



dt 



lip ILp 

V • f ^ ^ ~ ^ ■ f ^ '. ‘^SjDj{Uj)XCjj') = Qi{ci), 



i=i i=i 

X G fl, t G J, i = 1 , 2 , . . . ,Uc. 



(5.2) 
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Moreover, this method has been used in the numerical simulation for seawater 
intrusion [14,15]. Their mathematical model is 



(a) 

(b) 



OH 

V • {aVH) = Ss ~^ — q, x t G J, 

Or OH 

V • -V -Vc= 4-— + 4'S'sC— 

at at 

—q{c—c*), xGfl, t G J. 



(5.3) 
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A Model and Its Solution Method for a 
Generalized Unsteady Seepage Flow Problem 

Guoyou Zhang Tigui Fan Zhongsheng Zhao 

Dequan Yang 



Abstract 

A model and its solution method for a generalized unsteady seepage 
flow problem are given. Using this method, many generalized unsteady 
seepage flow control equations are described. Using the solutions to 
these equations, the pressure distribution of vertical crevice and level 
wells is obtained. We can analyze the unsteady testing material of oil 
wells. 



KEYWORDS: generalized unsteady seepage flow, control equations, oil well 

1 Introduction 

The study of seepage flow is important in engineering design. The new tech- 
nology of oil held development is hard to be treated. They are related to 
the study of seepage flow. Some models and their solution methods have 
been developed for seepage flow. Using Green’s functions and minor images, 
Ozcan and Raghavan gave some important results; also, see P|. A mo- 
del and its solution method for a generalized advective seepage flow problem 
is given. Using this method, many generalized unsteady seepage flow con- 
trol equations are described. The pressure distribution of the seepage flow 
is obtained. Three formulas are given with different a and 13. The formulas 
become approximately linear or steady state with the dimensionless time in- 
creasing. The source solutions are obtained when a — /3-1-2 > 0. The pressure 
distribution of vertical crevice and level wells is given from the solutions. We 
can analyze the unsteady testing material of oil wells with these results. All 
these results show that the method is effective. 
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2 A Mathematical Model 

If a and /3 are control parameters, we can change many seepage flow problems 
into generalized control equations for the unsteady Darcy flow of a weak 
compressible fluid in a porous medium: 



1 a _ ) 



dro 



dro J 



dto’ 



0<a<3, 0</3<5, 



( 2 . 1 ) 



where a = 0,^ + 7 - 1, /3 = 0^ + 7 - 1, p_D = kohAp/{cpQiiob), Ap = p-pi for 
injection wells, Ap = Pi~ p for production wells, tu = ctkot / {(f>p,oCtr'^) , and 
ro = r/r^j. Here h is the medium thickness. <f> the porosity, po the viscosity, 
Ct the compressible coefficient, the well radius, Cp and Ct unit conversion 
coefficients, pi the initial pressure, 0^ the interval changing coefficient, 9k the 
osmosis changing coefficient, 9p the fluid viscosity changing coefficient, and 
7 the fractal dimension. Form (2.1) owns an extensive meaning. It can be 
changed to various generalized unsteady seepage flow control equations with 
different a and jd. If Q is out of a vertical well, the unsteady control equation 
can be obtained for an even equal thickness 



d'^p 



'D 



drl 



J^d^ _ dpD 

td dro dto '' 



with the initial condition 



PD{rD,0) = 0 , 



the inner condition of the well 



0 dpo\ 

D 



dro J 



= - 1 , 



( 2 . 2 ) 



(2.3) 



(2.4) 



r_D = l 



the out boundary condition 



Pd{(x>Ad) = 0 , 



(2.5) 



and the pressure at the well wall 

Psd{s) = PsoilAo)- 



( 2 . 6 ) 



Eqs. (2.2)-(2.5) are unsteady seepage flow formulas of an even equal thickness 
vertical production well. 
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3 Solution 



Making the Laplace transform for form (2.2), if s is the Laplace transform 
variable and 



we can obtain 



p{rD,s) = j PD{rD,tD)e 



d^PD , (3 dpD a-0~ 

3 ^ Pd, 

ori, ro otd 



whose complete solution is 



pD{rD,s) = < 



,1-/3 



D 



CiF Zi\ r 



,(«-/3+2) 



D 



+ C2K^ \^i^r 

01 — /3 -t“ 2 > 0, 



,(«-/3+2) 



D 



a-/3+2 = 0 , 



,1-/3 

D 



CiF Z 2 \ r) 



(X — /3 -l- 2 0 , 



where /^(*) and K^{-) are the 7 step Bessel functions of second type: 

1-/3 ,, Vs ^ 2v^ 

' 0-/3 + 2’ 0-/3 + 2’ P-a-2 

We can also obtain the pressure distribution in the medium from (2.4) and 
(2.5) 



,(l+/3) , 






pD{rD,tD) = \ 



SVSI r-ijZl) 

s( /3 + 7 (r Wf^- i)^ 

S,JsIr-l{Z2) 



ot — /? -h 2 > 0 , 

a-/3 + 2 = 0, 

0-/3 + 2 < 0. 



When the dimensionless time become longer, the gradual results of form (3.1) 
are 



Psoitn) 



1 



if a — P + 2 < 0, 
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and if a — /? + 2 > 1 , 



Psoito) 



!i!' 

a + 1 
1 1 

. 7^1- 



/?> 1 , 

{LMd + ‘ILn{a + 1 ) + 7 ) , /3 = 1 , 



-{a-f3 + 2f^ 



/ 9 < 1 , 



We can see the change trend of the pressure and pressure derivative from this 
equation. 

There is a real space solution under the point source inner boundary 
condition for the first case of the complete solution. Other solution conditions 
do not change. The point source inner boundary condition is 



rpdpD 

dro 



= - 1 . 



I’D— >0 



When making the Laplace transform, the pressure distribution is 



PD{rD,s) = 



1 






s-/ir(l — 7 ) \a — P + 2 

which can be inverted as follows: 



1-7 



,(1-/3). 

D 



KMZ,\r 



,(a-/3+2) 

D 



PoirDAo) = 



1 

T(f-7Ka-> + 2)1-27 



tn 



^ exp I — 



«-/3+2 
' D 

{a — fl + 2 ) 2 t 



dr. 



Beier’s 0 point source solution can be obtained from this equation. 

Using numerical methods, the dimensionless well wall pressure and pres- 
sure derivative can be obtained from (3.1) with different a and /3. The 
unsteady testing material is analyzed by using these results. 



4 Conclusions 

( 1 ) The generalized unsteady seepage flow model which we obtained can be 
changed into unsteady seepage flow control equations. 

(2) There are three forms for the pressure distribution of seepage flow which 
we obtained with different a and /3. The analysis shows that these three forms 
are steady state or approximately linear with longer dimensionless time. 

(3) When a — /I + 2 > 0, the solutions of the point source in real space are 
obtained by using numerical methods. The pressure distribution of vertical 
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crevice and level wells is obtained from these solutions. The unsteady testing 
material of the oil well pressure is analyzed using these results. 
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Domain Decomposition Preconditioners 
for Non-selfconjugate Second Order 
Elliptic Problems 



Huaiyu Zhang 



JiACHANG Sun 



Abstract 



A non-symmetric interface Schur complement arises from non- selfcon- 
jugate second order elliptic problems with domain decomposition me- 
thods. The usual numerical methods for solving it are GMRES, OR- 
THOMIN, and BICGSTAB, but they take a large amount of compu- 
ter time and memory. The authors find in this paper that the non- 
symmetric Schur complement can in fact be changed into a symmetric 
one by scaling. Then an efficient preconditioner can be provided by 
which the preconditioned system can be solved iteratively by a mo- 
dified PCG method. When the problem is imposed on a rectangular 
region, the condition number is estimated and is nearly one. Numeri- 
cal experiments are also presented. Non-selfconjugate problems arise 
in mathematical modeling and numerical simulation of fluid flows and 
transport in porous media. 



KEYWORDS: non-selfconjugate, elliptic equation, domain decomposition, 
Schur complement, preconditioner 

1 Introduction 

Consider the non-selfconjugate second order elliptic equation 



Au + uj-\/u=f in O, 



( 1 . 1 ) 



where O is an L-shaped region in R^; i.e., O = U with (see Fig. 1) 
= (0,ai) X (0,6i), = (01,02) X ( 0 , 62 ). 



The Dirichlet boundary condition is imposed 



u = 0 on (9n. 



( 1 . 2 ) 



Z. Chen, R.E. Ewing, and Z.-C. Shi (Eds.): Proceedings 1999, LNP 552, pp. 409-^^^ 2000. 
(c) Springer-Verlag Berlin Heidelberg 2000 
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We assume that we can construct the rectangular grids on the rectangles 
t = 1,2. With the common step h, we have 

= {{nh, mh) : n = 0, . . . , Ni,m = 0, . . . , Mi,Nih = oi, M\h = 6i}, 

= {(nh, mh) :n = Ni,...,Ni + N 2 ,m = 0,..., M 2 , 

(iVi + N2)h = a2, M2h = 621 - 

Let denote Clh = U We decompose Clh into two parts: = 







p 


• J 

3 

tc 



Figure 1: An L-shaped region. 
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We now partition the vector of unknown coefficients in the same way as the 
matrix: u = ■ The linear system can then be written as 



( Af} 



0 \ 



Af] 0 



/ / 0 \ / .w \ 



41 I J 



0 / 44^41 



,( 2 ) 



= /, 



(1.5) 



V 0 0 5 ( 1 ) + 5 ( 2 ) ) V Ub ) 



where 5(*) = ^4 ~ i® called the Schur completement and 

each can be calculated independently. Perform a forward solver to obtain 



\ / \ 



f I 0 Ajj Ajg \ I U.J 



0 / A\r^A\g 



(2)-l A2) 

II ^IB 

V 0 0 5(1) + 5(2) J 



,( 2 ) 



\ Ub ) 



\-A 



( 1 ) 

BI 



0 

I 

-A 



0 \ / 



(1)-1 



( 2 ) 

BI 



I ) 



0 \ ( \ 



( 1 . 6 ) 



V 



A 



(2)-l 

II 

0 



l) 



fp 

V fB J 



Next, we must solve the reduced Schur complement problem 

(Sf> + = /a - Amp'r-'P’ - Al.AfrP' ^ 9- 



( 1 . 7 ) 



The solution of this equation gives us the values along the interior boundary 
and the problem then splits into two independent subproblems. We can thus 
back solve in parallel for the interior unknowns in each substructure: 



(b _ fd 



II 



(/!” - 



( 1 . 8 ) 



This procedure can be generalized to the case of N subdomains. The 
Schur complement arising from non-selfconjugate second order elliptic equa- 
tions is non-symmetric. Its condition number grows like 0{l/h). Since it is 
denser than the original stiffness matrix, in general, the linear system (1.7) is 
solved iteratively. The usual iterative methods for non-symmetric problems 
are GMRES, ORTHOMIN, and BICGSTAB. 

Several papers of Ghan, et al. have been devoted to the construction 
of preconditioners for this kind of non-symmetric Schur complement; see 
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0OJ. These preconditioners are referred to as interface solvers or interface 
preconditioners. The authors in this paper show that the non-symmetric 
Schur complement can be changed into a symmetric one only through the 
method of diagonal scaling. An efficient preconditioner is also provided, by 
which the preconditioned system can then be solved iteratively by a modified 
PCG method instead of those mentioned above. 



2 Diagonal Scaling and Preconditioners 



Suppose that the standard five-point central difference scheme is applied, 
which has the form 

-1 -I- ^L02h 

—l—^LUih 4 —l+^u!ih 
— 1 — ^UJ2h 

Let Ti, i = 1,2, be an Mi x Mi matrix having the form 

/ 4 -1 -I- luj2h \ 



y[j] _ 



— 1 — ^C02h 



—1 -I- ^UJ2h 



V 



The Schur complement then becomes 



— 1 — ^UJ2h 



5* = r[2] _ (1- 



Ni 






( 2 . 1 ) 



where Bn has the recursive relations 

fiW = (tW - (1 - , Bf = i = 1,2. 



( 2 . 2 ) 



Let D be a uniformly bounded diagonal matrix with its elements given by 



di — 



1 1 + ^iU2h 
1 - \w2h 
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Let S = DSD~^. It can be easily seen that S becomes a symmetric matrix 
so that the CG method can be used. Numerical experiments show that the 
times of iteration by the CG method reduce largely. 

Let 

(2.3) 

where 



( 2 — 1 + 



\ 



K — 



— 1 — \uJ2h 



2 



— 1 + ^UJ2h 



\ 



— 1 — ^u>2h 2 / 



Also, let C = DCD ^ . Now, instead of solving (1.7), we can solve any of the 
two preconditioned systems by the PCG method 

C~^S^vb = DC~^Sg, ub = D~^vb, (2.4a) 



or 

S^C-^VB = DSg, UB = C-^D-^vb. (2.4&) 

For the PCG method, see Chapter 9 in j^. Or, equivalently, we solve any of 
the two by a modified PCG method 

DC-^S^ub = DC~^Sg, (2.5a) 

or 

DS^C-'^vb = DSg, UB = C~‘^vb. (2.5&) 

MODIFIED PCG ALGORITHM: 

1. Compute rg := g — Su^, zq := C~‘^Sro, and po ■= zq 



2. 


For j = 


= 0,1,..., until convergence 


3. 


aj := 


^ {Drj,Zj)/{D‘^PpS‘^Pj) 


4. 


H-i 

Ub 


:= u% + ajPj 


5. 


Tj + l 


:= Tj - ajSpj 


6. 


Zj + l 


:= C-‘^Sr,+i 


7. 


f3,:= 


{Drj+i,Zj+i)/{Drj,Zj) 



8 . pj+i := Zj+i + PjPj 

9. EndDo 

In the above algorithm, is a tri-diagnoal matrix, so it is easy for C~^ 
to do the matrix-vector operations. 
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3 The Condition Number 



Define the S'^-inner product by 



(x, J/)s 2 = y) = {Sx, Sy) = (x, S'^y). (3.1) 



For a linear operator L, which is self-adjoint with respect to the S'^-inner pro- 
duct, we use the Rayleigh quotient characterization of the extreme weighted 
eigenvalues 



min 



tr\ • .g2 {Lx,x)g2 

{x,x)g2 ^¥=0 {x,x)g2 



(3.2) 



Then the condition number of L is given by 



k{L) — A£ax(-^)/^£in(-^')- 



(3.3) 



Lemma 3.1 It holds that niC ^ 5 ^) = max (^ cc, x) ^ 

(C'^Xjx) ^^0 {^C‘^x,x) 

When the problem is imposed on a rectangular region, i.e., 61=62) we show 
that the condition number of the preconditioned Schur complement is nearly 
one. From the numerical experiments, it seems still true for the L-shaped 
regions. 

Theorem 3.1 Assume that a = min(ai,a 2 — ai) > 1 and irh < 1/3. Then 
the condition number of the preconditioned Schur complement is hounded by 

k(C-^S^) < (l + e-2"“)2. 

Proof: When a rectangular region is imposed, let have the recursive 

relation 

B^ = {T-{l-^ivlh^)B„_i)-\ Bi = T~\ 
where T = = T^^l. Then the Schur complement S has the form 

S = T-(l-^ujlh^){BN,+BN2). 

Under this case, S'^ and have the same eigenvectors 

52 = {D Q)A|(D Q)-\ As = Diag{A,}, 

= {D Q)AUD Q)-\ Ac = Diag{Md, 
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where Xi and fit are the eigenvalues of S and C, respectively. It yields that 
k(C-^S^) = max(^)Vmin(^)2. 

Pj Pj 

Let 

aj = ch~^ I — ^ 1 > 0, t , = 4 — 2\ 1 — cos jirh. 

\2^i-iy,hd * “ 



We see from the corresponding recursive relations of eigenvalues that 



A,= 




ch{Ni + l)aj 
sh{Ni + l)aj 



Pj = 2Wl — jLo'lh'^shaj. 



ch{N2 + l)aj 



Therefore, we have 

Xj 1 , ch{Ni + l)aj ch{N 2 + i)ctj , 

Pj 2 sh{Ni + l)aj sh{N 2 + l)aj 

g-2(Ari + l)a,- ^-2{N2 + l)aj 

^ ^ + 1 _ g-2(Wi + l)a^ + I _ g_2(JV2 + l)aj 



When 7 t/i < 1/3, it follows that 



e““^< 



(- 



< 2 — cos irh — \/(2 — cos — 1 < 1 — 2j:h. 



-.y-1 



Therefore, we obtain 



This completes the proof. 



A, , 

^ < 1 + e” 

Pj 



4 Numerical Experiments 

Let 



K = 



( 2 -1 
-1 2 

V 



-1 
-1 2 / 
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and 

C^,=K+ Ci = D{K^, + \kX)D-\ 

Ci = K + -K^ + Cj = + -^K^2 + 

The following numerical experiments are done using Matlab, where the pa- 
rameter uj is given in the table and h=l/Ni. 



UJ 


k{S) 






K(cr"s") 


(5,5) 


19.87208 


1.50997 


1.20197 1.20464 


1.00002 


(10,5) 


16.04520 


2.26958 


1.80664 1.12779 


1.000001 


(5,10) 


17.97331 


2.28940 


1.12501 1.82647 


1.000001 




Table 1. A'"! = A ^2 


= 40, Ml = M 2 = 25. 




CJ 


k{S) 








(30,30) 


7.04838 


2.06521 


1.81486 2.06983 


1.000000 


(40,20) 


5.64662 


21.28800 


4.01108 1.27035 


1.000000 


(20,40) 


8.66044 


25.70777 


1.19912 5.11383 


1.000000 




Table 2. = A ^2 


= 40, Ml = M 2 = 25. 




UJ 


k{S) 








(5,5) 


17.25141 


1.44270 


1.27717 1.27788 


1.18549 


(10,5) 


14.50527 


1.90157 


1.64653 1.21589 


1.14762 


(5,10) 


15.79645 


1.91053 


1.21513 1.65534 


1.14760 



Table 3. Ni = N 2 = 40, Mi = 25, M 2 = 20. 
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CO 


k{S) 






1 




(30,30) 


6.79150 


13.80025 


1.77435 


2.01674 


1.03194 


(40,20) 


5.54002 


14.20308 


3.72434 


1.26206 


1.02831 


(20,40) 


8.07529 


17.10853 


1.19631 


4.72469 


1.02811 




Table 4 


. Ni=N2 


= 40, Ml = 


25, M2 = 20. 




CO 


k{S) K{Cf'^S^) 


«(C2-"S") 






(5,5) 


9.73994 


1.18636 


1.16951 


1.16972 


1.16500 


(10,5) 


9.10944 


1.25928 


1.21972 


1.14636 


1.13931 


(5,10) 


9.37308 


1.26280 


1.14621 


1.22346 


1.13936 




Table 5 


. Ni=N2 


= 40, Ml = 


30, M2 = 10. 




CO 


k{S) 






1 


k(CF^S") 


(30,30) 


5.60318 


4.47593 


1.55878 


1.73865 


1.03195 


(40,20) 


4.90427 


4.56155 


2.56449 


1.21294 


1.02832 


(20,40) 


6.08107 


5.40245 


1.15975 


3.16567 


1.02812 




Table 6 


. Ni=N2 


= 40, Ml = 


30, M 2 = 10. 
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Performance of MOL for Surface Motion 
Driven by a Laplacian of Curvature 

Wen Zhang Ian Gladwell 



Abstract 

We analyze the performance of the method of lines when solving 
a partial differential equation system describing microstructural evolu- 
tion in a sintering process. The system involves a fourth order nonlinear 
partial differential equation with a moving boundary. Both sequential 
and parallel ordinary differential equation solvers are applied. 

KEYWORDS: method of lines, parallel ODE solver, partial differential equa- 
tions, sintering, surface diffusion, motion by curvature, moving boundary 

1 Introduction 

We discuss and analyze the performance of the method of lines (MOL) when 
solving a system of partial differential equations (PDEs) describing micro- 
structural evolution in a sintering process. Sintering is a material manu- 
facture process where powdered material is densified to form a solid body. 
From a microstructural perspective, the sintering process can be viewed as the 
combination of two diffusion processes: surface and grain boundary diffusion. 
The mathematical model for surface diffusion is a fourth order nonlinear PDE 
and for grain boundary diffusion is a second order linear PDE coupled with 
an integral equation. The two models join at the moving boundary and con- 
stitute a closed mathematical system governing the movement of the particle 
surfaces during sintering; detailed descriptions can be found in [nmiiii- 
Here, we focus our attention on the surface diffusion model. The charac- 
teristic feature of surface diffusion is that the mass flow is restricted along the 
surface. Following the theory of Herring [3 ^3 El) mass flow is generated 
by the surface gradient of the chemical potential, which is proportional to 
the surface curvature. Hence, locally the motion is driven by a surface La- 
placian of the curvature. This motion is nonlinear and complex. There are 
limited analytical studies 0 E] • Particularly, Elliott and Garche uni prove 

Z. Chen, R.E. Ewing, and Z.-C. Shi (Eds.): Proceedings 1999, LNP 552, pp. 419-^^^ 2000. 

(c) Springer-Verlag Berlin Heidelberg 2000 
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existence of a solution under certain conditions. We have used MOL suc- 
cessfully to solve the system ^01 fTTl ITH] employing the ordinary differential 
equation (ODE) solvers LSODE ^1^ and DASSL Here, we explore 
the use of a new generation of ODE solvers, both sequential and parallel, and 
compare them for convergence of the solution of the PDE. 

We use four ODE solvers, the sequential solvers VODE ^ and VODPK 
IQ Qi and the parallel solvers PVODE |S| and the version of PVODE in 
PETSc m El- VODE was developed from LSODE. VODPK is identical to 
VODE except for the linear solver. VODE uses a direct method as in DASSL 
and LSODE, while VODPK uses an iterative method; i.e., a scaled, pre- 
conditioned, incomplete generalized minimum residual method (SPIGMR). 
PVODE is a parallel solver based on VODPK and its modularized C version 
CVODE |H|. It is built upon the Message Passing Interface (MPI) library 
and is intended for a single program multiple data environment on distri- 
buted memory computers. PETSc is a parallel computation library that 
includes linear and nonlinear equation solvers, unconstrained minimization 
modules and ODE solvers. It is also built upon MPI and includes PVODE 
(with some modifications) as one of its ODE solvers. For simplicity, we shall 
use the notation PETSc to represent the PETSc/PVODE solver. 

An advantage of using MOL to solve PDFs is the stability of the numerical 
solution achieved from employing the dynamically chosen time steps and 
formulas of the integration. Here, stability has two meanings. One is stability 
of the solution as time proceeds. The other is stability of the solution as the 
spatial step size decreases. The latter is a virtue of the MOL approach. With 
a reliable ODE solver, MOL can guarantee error control on the solution for 
any sufficiently small integration error tolerance and all small spatial step 
sizes. That is, once we decide on a spatial discretization scheme and choose 
a sufficiently small integration error tolerance, the error in the solution will 
not grow as we refine the spatial mesh. 

2 The Mathematical Model 

To evaluate the performance of MOL, we consider 2D sintering; 3D cases are 
discussed in fZjE|. Since we are concerned with only the surface profile, 
this yields a computational ID problem. Consider a periodic 2D particle 
string shown in Fig. 1 with the x-axis as a line of symmetry and vertical 
grain boundaries. 
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Figure 1. Periodic 2D particle string. 



The surface is represented by function y(t, x) with t being the time va- 
riable and X the spatial variable. For this system, the dimensionless surface 
diffusion equation is 

dy dJ 

dt dx ’ 



with surface flux 



J= - 



, dy 
1+1 / 
ax 



- 1/2 



dK 

dx 



and surface curvature 



K =- 



dx"^ 



1-h 




Exploiting symmetry, we need compute only that part of the surface pro- 
file between a grain boundary and the central vertical line of a particle. By 
placing the origin at the center of a particle, we have symmetric boundary 
conditions (BCs) at a; = 0: 



Vx fll Vxxx fl 



At the other boundary, x = xcsit), we impose the BCs of a fixed dihedral 
angle, grain boundary flux and material plating rate: 

y^ = -cot(-), 



where A is the dihedral angle. Job is the grain boundary flux at x = xoB(t) 
and r is the ratio of grain boundary to surface diffusion. All variables are 
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dimensionless. Their relation to the physical units is shown in m - When 
Jgb ^ 0, we have a moving boundary problem. 



3 The Computational Model 



We select a mass conserving finite volume scheme for spatial discretization 
of Eqn. (1). In this scheme we use second order central differences for all 
derivatives except near the triple point, x = xqb-, where we use one-sided 
second order differences. It can be shown that the spatial discretization error 
is of second order for solutions in C® . 

Since the moving boundary is unidirectional in our model, we select a 
front tracking method which maps our coordinates to the moving boundary. 
Hence our computational grids are fixed but our physical grids move with 
the moving boundary. In ID this simply implies scaling. Throughout, we use 
uniform meshes in space with iV equally spaced mesh points and a step size 
h = 1/{N — 1). We let j/„ = y{t, s„) where s„ is a mesh point of the scaled 
spatial variable s = xjxGB- After discretization, the PDE system becomes 
the ODE system 



1+1/2 — Jn- 1/2 , Vn+l ~ Un-l , 
y^t — rzr. ' or™ SuXqb, 



Jr 



+ 1/2 



hxQB 



1-k 



2hXGB 

f ?/n+l Vn—l \ 

\ 2hxGB ) 



= - 



^ I f Vrt+2 Ur 

\ 2hxGB 

Vn+l ‘2yn -\- yn—i 



- 1 / 2 ' 



- 1/2 



J^n+1 



hXGB 






GB 



1-k 



2/n+l Vn—l 



2hxGB 



-3/2 



iT f A \ 

GBit) = l^sin - - KNyN)j , 



3Jiv — 4 Jjv- 1/2 + (>^Ar-l/2 + >^Af-3/2)/2 , 

VNt = r ^VxXgb, 



•^Af-1/2 — - X 



1-t- 



hxGB 

f yn+1 Vn—l 

\ 2hxGB 



- 1/2 



+ + 



ATn-t-1 

hxGB 



for n = 1, 2, . . . , TV — 1 with yo = ?/ 2 , y~i = ?/ 3 - The Jacobian of this ODE 
system typically has the structure shown in Fig. 2. It has band width 5 
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X X X X X 

XXX 

Figure 2. The two band structure of the Jacobian. 

everywhere except near the right end where it has three full columns. Also, 
the lower band width increased by 1 due to using one-sided differences. This 
structure often occurs in moving boundary problems and can be exploited 
computationally by a combination of algorithms for banded and full matrices. 

4 Computational Performance and Analysis 

To be thorough, we have selected a wide range of integration error tolerances, 
tol, and numbers of spatial mesh points, N. The range of tolerances used is 
tol = 10“^ to tol = 10”^^ and N ranges from 33 to 1025. As an ‘analytical’ 
solution, we use the numerically proved ‘exact’ solution obtained with the 
finest spatial mesh, N = 1025, and the most stringent tolerance, tol = 10“^^. 
We fix the physical parameters in the sintering model at physically realistic 
values: the dihedral angle A = 160° and the ratio of grain boundary to surface 
diffusion T = 0.1. To better illustrate the error behavior, we use an initial 
sinusoidal surface with maximum height 1 and initial grain boundary of height 
0.156. Our computation reaches the equilibrium state stably. Fig. 3 shows 
the initial and final surface profile. To compare errors, we stop the integration 
at a fixed time, t = 1, before equilibrium is reached. Then, the height of the 
entire surface lies between 0.85 and 1 so the absolute and relative errors in 
the solution are almost equivalent. For simplicity, we measure absolute error 
and use the scaled variable s instead of x. 
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We use the backward differentiation formula (BDF) option in the ODE 
solvers. Fig. 4 shows the errors as the spatial step size decreased by halving. 
The errors behave almost uniformly on the surface and converge to zero as 
the spatial step size approaches zero. The calculated orders of convergence at 
the midpoint in s in Fig. 4 are shown in Table 1. We observe the anticipated 
second order convergence for a sufficiently large number of mesh points. In 
VODE, we implement the full exact Jacobian matrix. In VODPK, we use the 
full dimension of Krylov subspace and no preconditioning. In PVODE, we use 
the built-in block diagonal with banded block preconditioner (PVBBDPRE) . 
In PETSc, we use the full dimension of Krylov subspace and 5 x 10“® for the 
ratio of tolerances of the linear to the nonlinear solver. The four ODE solvers, 
VODE, VODPK, PVODE, PETSc produce essentially the same results. For 
a less stringent integration error tolerance, tol = 10“"^, we no longer observe 
second order convergence as the number of spatial mesh points increases, see 
Figs. 5(a)-(c). This is due to the integration error dominating the spatial 
discretization error. Note, from Figs. 5, the errors are bounded as the number 
of mesh points increases, demonstrating that stability is preserved by the 
MOL approach, due to the high quality of the ODE solvers. 



Initial and Final Equilibrium Surface and Grain Boundary Profiles 



— -- 




Initial surface — 
Equilibrium surface — 
Grain boundary 






Grain boundary ' 












\ 



0 0.5 1 1.5 2 2.5 3 3.5 

X 

Figure 3: Profiles. 



Results using VODE/PVODE/PETSc with Tolerance=1d-10 




Figure 4: Errors. 



We also observed that error growth can be induced by poor approxima- 
tions to the Jacobian, and by errors from the linear and nonlinear solvers. 
The iterative linear system solvers in PVODE and PETSc can perform as 
accurately as the direct linear solver in VODE with an appropriate tolerance 
in the linear solver, Krylov subspace dimension and preconditioner. Without 
properly adjusted parameters, the iterative solver can produce poor results 
and MOL can loose stability as the spatial mesh size decreases. Fig. 6 shows 
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results when using the default parameters in the linear solver in PETSc; i.e., 
the linear to nonlinear iteration convergence factor is 0.05 and the Krylov 
subspace dimension is 5. The errors grow significantly as N increases and 
eventually stability is lost. Fig. 7 shows corresponding results when we use 
the banded Jacobian approximation with band width 5 in VODE; hence cut- 
ting out the moving boundary information in the Jacobian. Again, observe 
the growth of the errors and the instability induced as the number of mesh 
points increases. In these cases, MOL behaves like a finite difference method 
where the constraint for stability is a simple relation between the time and 
spatial step sizes. The instability can be suppressed by decreasing tol. Ho- 
wever, our tests show that this is less efficient than using properly adjusted 
parameters and an appropriate Jacobian approximation. 



Results using VODE with Tolerance=1d-4 




Results using PVODE with Tolerance=1d-4 




Figure 5a: Errors with VODE. Eigure 5b: Errors with PVODE. 



Results using PETSc with Toierance=1d-4 





Figure 5c : Errors with PETSc. 



Figure 6: Errors with PETSc-default. 
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Mesh 


33 


65 


129 


Order 


1.53 


1.82 


1.98 



Table 1: Convergence rate with spatial mesh size 



Band Width 


5 


7 


9 


11 


13 


65 


full 


Computing Time 


1302 


9.261 


9.578 


12.80 


12.94 


22.18 


29.16 



Table 2: Computing time in seconds for varying band width; 257 spatial mesh 

points; integration error tolerance 10”^^. 



Although the Jacobian is not banded, it can be well-approximated by 
a banded matrix with an appropriate band width. We use a Jacobian ap- 
proximation with band width 7 instead of the band width 5 which actually 
applies in most parts of the Jacobian. The numerical results from VODE, 
see Fig. 8, show that the errors and instability are completely under the 
control. The results here are as good as in Fig. 5a where the exact full Ja- 
cobian is used. Using this banded approximation significantly reduces both 
the computing time and the coding effort for the Jacobian. (In our tests, we 
modified VODE slightly in the way it forms the approximated Jacobian to 
achieve better results.) Further increases in the band width gave no signi- 
ficant improvement, whereas reducing the band width led to growth in the 
error and to instability. We also tested using different band widths in the 
preconditioner PVBBDPRE employed in PVODE, see Table 2. The results 
are similar to those when using VODE. Table 2 presents results for equal 
upper and lower half band width. 

The excessive computing time for band width 5 indicates that the ap- 
proximation for the Jacobian is too poor, so instability occurs, and time is 
wasted integrating through the instability. For band widths greater than 5, 
the computing time dramatically decreases and there is no instability. The 
most efficient band widths are 7 and 9, just above the critical band width 
5. The critical band width case can be analyzed by studying the eigenvalues 
of the Jacobian. A good approximation to the Jacobian should have eigen- 
values close to those of the exact Jacobian. Since our problem is nonlinear 
we computed the eigenvalues of the Jacobian and its banded approximations 
at a fixed time in the integration. Table 3 shows the largest 4 eigenvalues 
computed at the same time as when we compare the errors in the solution. 
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Band Width 


1 


3 


5 


7 


9 


exact 


Eigenvalue 1 


1.8 


1.7-b6 


-5.7-2 


1.1 


1.0 


-0.21-bi0.19 


Eigenvalue 2 


-4.9-b6 


1.6-b6 


-2.6-bl 


-1.7-1 


-2.0-1 


-0. 21-10.19 


Eigenvalue 3 


-4.9-b6 


1.5-b6 


-7.8-b2 


-8.0-bl 


-8.1-bl 


-8.3-bl 


Eigenvalue 4 


-5.0-b6 


1.3-b6 


-4.8-b3 


-1.2-b3 


-1.2-b3 


-1.2-b3 



Table 3: Largest 4 eigenvalues for the approximation to the Jacobian, band widths 
1, 3, 5, 7, 9 and the exact Jacobian. 



When the band width is less than 7, these eigenvalues are far from those of 
the exact Jacobian. They change significantly with increasing band width. 
For band width 7, there is a significant improvement in the accuracy of the 
eigenvalues. The largest two eigenvalues correspond to a split of the only pair 
of complex eigenvalues of the exact Jacobian. The remaining eigenvalues are 
within three correct digits of those of the exact Jacobian. As the band width 
increases further, the eigenvalues do not change significantly. Hence, by choo- 
sing the smallest band width above the critical value we are able to achieve 
most efficiency when using the ODE solvers. 



Results using VODE with Banded J Approximation of Band Width 5 

mesh 33 — 
mesh 65 — 

mesh 129 

mesh 257 

mesh 513 — 



0.4 0.6 



Results using VODE with Tolerance=1d-4 



mesh 33 - 
mesh 65 • 
mesh 129 - 
mesh 257 
mesh 513 - 



Figure 7: Errors with VODE-band 5. 



Figure 8: Errors with VODE-band 7. 



5 Conclusions 

We have shown that when solving surface diffusion problems with a moving 
boundary using MOL and quality ODE solvers, for stability and accuracy it 
is essential to use sufficiently stringent integration tolerances and sufficiently 
accurate approximations to the Jacobian. The Jacobian resulting from MOL 
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typically has a structure of a diagonal band and a vertical band for our 
problem. It can be well approximated by a banded matrix with a sufficiently 
wide band width. Also, the block diagonal preconditioner performs well with 
the iterative linear solver. 
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A High-Order Upwind Method 
for Convection-Diffusion Equations 
with the Newmann Boundary Condition 

Weidong Zhao 



Abstract 

In this paper a high-order upwind hnite difference method is stu- 
died for steady convection-diffusion problems with the Newmann bo- 
undary condition. Based on these equations in the conservation form, a 
conservative high-order upwind finite difference scheme on nonuniform 
rectangular partition is proposed. The scheme satisfies the maximum 
value principle and has a second-order error estimate in the discrete 
norm. The method and its analysis apply to groundwater pollution 
and reservoir simulation problems. 



KEYWORDS: convection-diffusion, high-order upwind method, error esti- 
mate 



1 Introduction 

Consider the 2D, steady, linear convection-diffusion problem 

—V • (AVm) -I- V • {bu) + cu = f, X € n, 

{AVu) ■ it = 0 X GT = 50 , 

where u = u{x, y) is the transport quantity, A = diag(ai(a;, y), a 2 (x, y)) is the 
diffusion coefficient matrix, b= {bi{x , y) , b 2 {x , y))"^ is the transport velocity, 
c = c{x,y) and / = f{x,y) are known functions, O is a bounded region in 
B? , 50 is the boundary of O, and it is the normal exterior unit vector to 50. 

Many realistic fluid flow procedures, such as groundwater pollution and 
reservoir simulation problems, can be described in the form of convection- 
diffusion equations [1, 2] and these equations are often convection-dominated. 
It is very difficult to simulate this kind of problems numerically. The ordi- 
nary finite difference and finite element methods, which have been widely 
used in numerical simulation of the nonconvection-dominated problems, in- 
troduce nonphysical oscillations. To overcome nonphysical oscillations, over 
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the past three decades, the upwind schemes have been extensively studied 
and widely used in the numerical simulation of this kind of problems (e.g., 
see [3-10]). The standard upwind methods avoid the numerical oscillation, 
but they only have first-order accuracy. To improve the accuracy of up- 
wind methods, many upwind schemes have been proposed by modifying the 
second-order differential terms to obtain the second-order accuracy. But some 
of them do not preserve the maximum value principle and others do not pre- 
serve the conservation property. To establish a high-order method which is 
in the conservation form and preserves the maximum value principle, for ID 
problems it has been extensively studied, but many such methods are not 
as efficient for high dimensional problems as for ID problems. Thus it is 
very important and interesting to consider the multi-dimensional high-order 
upwind methods, which are in the conservation form, satisfy the maximum 
value principle, and have a high efficiency of applications. 

In this paper we study a high-order upwind method for (1.1). In [7], a 
modified high-order upwind method on uniform rectangular grids has been 
proposed and analyzed and its theoretical results can only be obtained for 
uniform partitions. With a new technique of modifying the convective and 
diffusion terms and the domain partition, a new kind modified upwind fi- 
nite difference method on nonuniform rectangular partitions is proposed and 
analyzed in this paper. The proposed scheme is in the conservation form, 
satisfies the maximum value principle, and has a second-order error estimate 
in the discrete norm. 

The paper is organized as follows. The modified high-order upwind fi- 
nite difference scheme for (1.1) is given in §2 and its theoretical analysis is 
expanded in §3. Some conclusions are given in §4. 

2 High-Order Upwind Scheme 

In this section, we consider a high-order upwind finite difference method for 
multi-dimensional problems. For simplicity, only 2D problem is considered. 
The results obtained are also true for multidimensional problems. 

2.1 Discretization and Notation 

Let 0 = xo < xi < ... < xn < xn +1 = 1 and 0 = yo < Vi < .■• < 
Vm < Vm +1 = 1 be a partition of = [0, 1] x [0, 1] in x and y directions, 
respectively. The partition is denoted as In with grid points 0 < i < N, 
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and 0 < j < M, where (i,j) represents {xi,yj). For a continuous function 
f{x,y), fij means f{xi,yj). Let Xi+ 1/2 = Xi + Xi+ij2, yj+ 1/2 = Vj + J/i+i/2 
for t = 1, 2, . . . , TV - 1, j = 1, 2, . . . , M - 1, with xi /2 = 2 / 1/2 = 0, xn+i /2 = 
Vm+ 1/2 = ^, hf = Xi+i /2 - Xi_i /2 for z = 1, 2, . . . , fV, = yj+ 1/2 ~ yj- 1/2 
for j = 1, 2, . . . , M, for z = 1, 2, . . . , fV - 1, /iJ+ 1/2 = 

2 / 1+1 — 2/1 for J = 1,2,...,M — 1, = max /if, = max UH, and 

I*j = [a:i_i/ 2 ,a:i+i/ 2 ] x [ 2 //- 1 / 2 , 2 //+ 1 / 2 ]- Let denote the dual partition of 
Q. with Ih- The nodes of are {xi+ 1 / 2 , yj+ 1 / 2 ) for z = 0,1,..., /V, j = 
0,1,. ..,M. 

Now, define Vh as a piecewise constant function space on i.e., Vh = 
{z;, v\i/. = Vij,i = 0,1, ... ,N,j = 0, 1, ... , M}. Define the discrete and 

norms for Vh as 



i+ij Ujj — i) 



h 



i-1/2 



)tfi, 



i—lj — 1 x-l/2 

N M 

Hlh = E E [bii-i/2,j{utj - u,-ij)^h^j + b2^,j-l/2{uij - u^j-i)^hf], 

Z=1 j — 1 

N M N M 






= E E hll?. = E E 



i=ij=i 



i=ij=i 



2.2 A High-Order Upwind Scheme 



To construct a high-order upwind scheme for (1.1), we first outline the method 
for a ID convection-diffusion equation. The ID convection-diffusion equation 
is 



d , du. d ^ 

— — (a — ) -I- — (ozz) -I- czz = /, xGii, 

dx dx dx 

du. du. 

— = 0 , 

dx dx 



( 2 . 1 ) 



where u = u{x) is the transport quantity, a = a(x) > 0 is the diffusion 
coefficient, b = b{x) is the transport velocity satisfying 6(0) = 6(1) = 0, and 
c = c{x) and / = f{x) are known functions. 

Let 0 = xo < xi < . . . < xn < xn +1 = 1 be the partition of fl = [0, 1]. 
The partition is denoted as Ih. Let Xi+ 1/2 = Xi + xi+i/2 for z = 1,2, . . . ,N — 
1, with xi /2 = 0, xm+ 1/2 = 1, fo = a;i+i /2 - Xi_i /2 for z = 1, 2, . . . , iV, and 
hi+i /2 = Xi+i - Xi for z = 1, 2, . . . , /V - 1. 

Integrate (2.1) over [xi_i/ 2 , Xi+ 1 / 2 ] and use Green’s formula to get 



■[ai-|-l/2M(_|_l/2 ~ ®i-l/2'M(_l/2] + ^i+\ /2'Ui+l /2 ~ l/2Mi_ 1/2] 
-I- Jj. cudxdy = fj, fdxdy. 



( 2 . 2 ) 
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for i = 1,2, . . . , N. In (2.2), the central difference method may be used to 
approximate the term Wi+i /2 in 6 i+i/ 2 Uj+i /2 must be handled 

carefully in order that a nonphysical numerical oscillation does not occur. 



To do this, a special technique should be used. Here we approximate the 

f 1 X > 0 

convective term 6 i+i/ 2 Ui+i /2 as follows. Define H{x) = i C ~ ^ ■ From 



the equality. 



0, X < 0 



bi+l/2Ui+l/2 — h+1 /2[H /2 + (1 — ^^(&i+l/ 2 ))Wi+i/ 2 ], (2.3) 



we know that H( 6 j_|_i/ 2 )wi+i /2 and (1 — i?(&i+i/ 2 ))ui+i /2 are related to the 
positive and negative transport velocities, respectively. Thus H( 6 j+i/ 2 )wi+i /2 
and (1 — i?(&i+i/ 2 ))wi+i /2 must be approximated differently. From Taylor’s 
formula, has the two expressions 

Mi+l/2 = Ui-\ -^^u(+l/2 -^^w”i+l/2 + (2-4) 



^i+1/2 



— 



i+1/2. 









i+1/2 ■ 



0(h^ 



*+ 1/2 



)■ 



(2.5) 



From the nature of transport and the upwind idea, (2.4) and (2.5) should be 
used in i?( 6 i_|_i/ 2 )Mi+i /2 and (1 — i?(&i+i/ 2 ))wi+i/ 2 ) respectively. The term 
Ui-i /2 can be handled similarly. Then (2.2) can be written as 



Kn*+l/2 2 /^i+l/2 (,n*-l/2 2 )^i-l/2\ 

+bi+i/2[H{bi+i/2)Ui + (1 — i?(6i+i/2))Mi+l] — 1/2 1 / 2 ) 11 *- 1 



+ (l-i^(&*-l/2))n*]-['^±i^^^ 



*+1/2 



-U" A 



I/ 2 J 



+ fj, cudx + 4” ^i+ 1 / 2 ) “ //• 

( 2 . 6 ) 

Based on (2.6), a modified high-order upwind finite difference scheme for 
( 2 . 1 ) can be proposed as 



-Vj(a*V2[/), + Vi(6[/“")i + c,[/, = fi. 



(2.7) 
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for i = 1 , . . . , N, where 



V 7 t *V 7 TT\ Ui—l 

Vi(a VxUji — ^ ^ ®i-i /2 ^ )) 



2 r /2 

^“*+ 1/2 



‘■i+ 1/2 



i-\j 1 



'■^^ 1 '^ 2ai+i/2 + |^i+l/2|^z+l/2 

Vs(6C/“-)i = ^(6.+ i/2C/“+"i/ 2 - 6*-i/2C/ri/2), 
C^r+ 1/2 = H{h+if 2 )U^ + (1 - H{b,+ 1 / 2 ))U,+ 1 , 
^i= ft h- fi = i: Jj. fdx, 



with Oi/2 = OaT+1/2 = ^1/2 = ^AT+1/2 = 0 - 

Using the idea of getting the scheme ( 2 . 7 ), we construct the high-order 
upwind scheme for ( 1 . 1 ) as follows: 



^ xU^ij ^y(^2^y^)u 

+Vx{biU^n + Vy{b 2 U^yh + c,jUij = fij, 



(2.8) 



for i = l,...,N,j = 1 ,...,M, with ai/2j = apf+i/2,j = Oip/2 = ai^M+i/2 = 
0 i bi/2j = ^iv+i/2,j = h,i/2 = bi^M+1/2 = 0 , where 






- aij- 1 / 2 .i ) 

‘'i+l /2 "'i- 1/2 

Vy(02VyC/)ij = Jlj{0-2i^j + 1/2 - — a 

2 a^ 



hV '^Hj-l/2 ,y 

"'j + 1/2 



Uij Ui + — \ 



'i- 1/2 



a 



i*+i/ 2 ,i 



i*-i-i/ 2 ,i 2 aii+if 2 ,j + \bii+i/ 2 ,j\h'. 



2o2 
^“ 2 i, i + l /2 



i+l /2 



2 i, i + l /2 2 a 2 jj + i /2 + |& 2 i,i+l/ 2 l^J+l /2 ’ 

V,(&1 [/-),, = ^(6h+1/2.,C/“+"i/2o - ^H-l/2+C^ri/2.,)> 
t^r+ 1 / 2 ., = H{bull/ 2 ,,)U,J + (1 - H{bu+l/ 2 j))Ui+l,j, 

Vy{b2U-y)., = ^{b2., + ,/2UZ+,l2 - b2^,-l/2K]_,/,), 

^i]+l /2 = H{b 2 i,j + l/ 2 )Uij + (1 - i 7 (& 2 *,i+l/ 2 ))t^i,i+l- 



3 Theoretical Analysis 

In this section, we prove that scheme ( 2 . 8 ) satisfies the maximum value prin- 
ciple and obtain its error estimate. 
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3.1 The Maximum Value Principle 

It is well known that (1.1) satisfies the maximum value principle if 

div{b) -\- c > 0, {x,y)£fl. (3-1) 

Thus, for (2.8), we assume that 

hi-\-l/2,j - bli-l/2,j , ^2i.i+l/2 - b2t,j-lj2 , ^ ^ 

f^y > u (a.z) 

* 3 

holds for i = 1,2 ... ,N and j = 1, 2 . . . , M. 

Theorem 3.1 Scheme (2.8) is conservative and under condition (3.2), it 
satisfies the maximum value principle. 



Proof: From the definition of (2.8), it is easy to verify that it is conservative. 
To prove the maximum value principle, we write (2.8) in the equivalent form 

T j -\- CijUij -\- DijUi j—i -\- EijUijj,-i = Fij 

i=l,...,N, j = l,...,M, 



where 

Aij = [— 
Bii = \— 



Dij 



Ei4 — — 



F,., 



a 



5-1/2 
^li + l/2,j 
5+1/2 
2i,j-l/2 

ft.® 

5-1/2 
'2i, 2 + 1/2 

Td> 



C,/ = 



2 + 1/2 
_ r“li + l/2,2 



~ bii-i/2,jH{bii-i/2j)]hlj , 

+ bu+l/2,j {l-H{b,^+,/2,j))]hl 

~ b2i,j-l/2H{b2iJ-l/2)]hi , 

+ ^ 22 , 2 + 1 / 2(1 ~ H{b2ij+i/2))]hf , 



'^i+ 1/2 ® 



“2», 2 + 1/2 
ft® 



®2i, 2-1/2 

h 



n 



i + 1/2 f+l/2 j + 1/2 3-1/2 

+ [^12+1/2, /ll(^li+l/2, 2 ) ~ ^ 12 - 1 / 2 , 2(1 ~ 11 (^ 12 - 1 / 2 , 2 ))]^/ 

+ [^22,2 + 1/211(^22,2 + 1/2) ~ ^22,2-1/2(1 ~ H(^22,2-l/2))]^f + Cijhfh ^, 
= hjhfhl 



Further, 



Aij + Bij + Cij + Dij + Eij 

[bii+i/2,j ~ bii-i/2,j]h^ + [^ 22 , 2 + 1/2 ~ ^ 22 , 2 - 1 / 2 ] + Cijhfhj. 

From (3.3) and the definition of H{x), it is easy to verify that Aij, Bij, 
Dij and Eij are nonpositive and Cij is nonnegative. Under condition (3.2), 
Aij + Bij + Cij + Dij + Eij is positive too. Thus it satisfies the maximum 
value principle. [] 
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3.2 Error Estimate 

As we know, (1.1) is regular and uniquely solvable in if 0.5div6 + c > 0. 
Thus, for (2.8), we also assume that 



c 



* 



l^^n+l/2,j - &H-1/2J ^ , ^2i.i+l/2 - &2*j-l/2^ ^ ^ 

2^ hf ^ + ^ )>co>0 

^ J 



(3.4) 



holds for i = 1, 2, . . . , A^ and j = 1, 2, . . . , M. For the sake of error estimates, 
we assume that 



> QlO) ^2 > ^20 7 



(3.5) 



for two positive constants aio and 020 . 

Let Uij be the solution of (1.1) at the point (xi,yj). Then Uij satisfies the 
equality 



+ Vyibu^y),, + c,,u,,)]h-hy = £ Ab, 



where 



■ 1 _ r ^“1 ^^1 

Ay - -[aH+i/2j^b+i/2j - aii_i/2j^b-i/2jJhj, 

7,2 _ r ^“1 ^^1 

Ay - -[a2*y+i/2^ki+i/2 - a2i.j-i/2^ki-i/2jhi , 



— [(aij+i/2,i ' 
~(aii-l/2,j ■ 
= [(02*j + l/2 ' 
~(02ij-l/2 ■ 



\bli+l/2,j\h^j^l/2 * ^Ui+ij — U 

5 «H+i/2,jl ry 

^ 'b+1/2 



•1] 



kli-l/2j|b^_l/2 

2 



* \ 
~ ®li-l/2yV 



Uij 

bl/2 ^ 






l^2i,j + l/2 ^ \Ui j^i Uij 

2 ®2iy + l/2l 



\b2i,j-l/2\h'^j_l/2 



°'2t,j-l/2) 



k+1/2 

Ujj — Ujj-i , 



b-1/2 



-M, 
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■ [(oii+i/2,: 



\bii+i/2,j 



“A 



dx 









— [(02ij + l/2 - 



\b 2 i,j-i-i/ 2 \h^j+i /2 du 



)( 



dy 



U.i+i/2 



\b2i,j-i/2\h'^j_i/2 du 

-[a2i,j-l/2 ^ j(,^Uj-l/2 



\-l/2 
J + 1/2 

Mij — Uij-I 

ly h'H- 

^-1/2 



~ [^li+l/2,j(-ff (^li+l/2j)'*^ij + (1 ~ -f^(^li+l/2,i)'Wi+l ,j) 

— bli-l/2,j{H{bli-l/2,j)Ui-lJ + (1 — H{bii-i/2,j)Uij)]hj , 

4 = [^2ij + l/2(-ff (^2ij'+l/2)Wij + (1 — H{b2ij + i/2j)Uij+i) 

— b2i,j-l/2{H{b2i,j-l/2)UiJ-l + (1 — i7 (&2i j'-l/2 )Wij )] /if , 

5zi 

= -l/2[/>li+l/2j(l - 2i7(6ij+i/2j))/lf+i/2^li+l/2,i 



du , 



-/'H-1/2j( 1 - 2i7(6ii_i/2j))/lf_;^/2^li-l/2,j]^j> 
= -2[^2ij + l/2(l - 2i7(62i.i+l/2))/lJ+i/2^ki + l/2 

f)lJ 

~/^2ij-l/2(l ~ 2i7(62ij_i/2))/lJ_]^/2^li,i-l/2]^k 






Since 



H{bii^i/2j)uij + (1 — H{bii^if2,j))ui+ij — Ui+i/2,j 

1 OzL 

+ -(1 - 2i7(6ii+i/2j))/lf+i/2^li+l/2j 

(*r+l/2)^ I ',3^1 

g ^U+l/2,j + ^((/li+ 1 / 2 ) )> 

H{b2ij+l/2)uij + (1 — H (b2ij + l/2))uij + l = Uij + 1/2 

1 Ou 

+ -(1 - 2i7(62ij + l/2))/lJ+i/2^ki + l/2 

‘fi+ 1/2 — V3i-l/2 = + g(/li+i/2V3’k+l — ^i-l/27’”i-l/2) + 0{h^), 

(3.6) leads to 



+c^jU^j)]hfh^j = 



VyibU 






— (Vy(a2Vyu)ij + "S/ xibvA^'iij + 

. 9 , 9m, 9 , 9m, 9 ,, , 




438 



Zhao 



f) 6 8 

+ -^{b2u)ij + CijUij]hfhj + ^ ^ By- 

+ 0((/lf_i/2)^ + (^r+ 1 / 2 )^ + (^J+ 1 / 2 )^ + 



(3.7) 



where 

^ij = g[^li+l/2j(^i+l/2)^^li+l/2,i - ^li-l/2,j(^r-l/2)^^li-l/2j]^j> 

Ir, ^2^^■*^l . /lU ^ 9 .d^U, 



- ^l*J-l/2(^J_l/2)^^ki 



0 - ' y</- 

= gKk+ 1 / 2 ) — (^i_l/2) 

1 q2 ^2 



-l/2]/lf 



1 

^ij = 8k2*j + l/2(ft-J+i/2)^^ki+l/2 

, ^ ^2 
8 

B% = + l - (/lJ_i/2) 

1 B‘^11 

^ij = gkli+l/2j(^i+l/2)^^li+l/2,i 

1 B‘^11 k^i/ 

^ij = 8k2*j + l/2(^J+l/2)^^ki+l/2 - &l*J-l/2(^J_l/2)^^kj-l/2]^: 



4-1/2^ dy- 



;{b2u),j\hf, 



B‘^11 

- &li-l/2,j(^r-l/2)^^li-l/2,j]^j 

r\0 



Now, let = Uij — Uij. Then, use (2.8) and (3.7) to obtain that satisfies 



[-(Vi(a*VsC)*j - + Vi(6C““)ij + 

= E 4 + E Bb (3.8) 

+ 0((/iEi/ 2)^ + (^k- 1 / 2 )^ + (^> 1 / 2 )^ + (^j + l/2)^)*f^- 



Multiply (3.8) by sum it up for i = 1, 2, . . . , iV, j = 1, 2, . . . , M, and use 
the discrete Green formula to obtain 






N M 

L“li-l/2,jV i^x 

i=lj=l "'i-112 ""i-1/2 

\\bu-l/2M &\x^^~"'' fhU,2h) + ^k2..,-l/2lE^(^V^)^E"_l/2/^"] 

^ 'h-1/2 ^ ^j-lj2 

N M ; 

E S kii + 



-l/2^■ + 2 ' 

1 kli+l/2,j ~ ^li-l/2,i ^2i,i+l/2 ~ ^2i,j-l/2Ni^2 









'-n ' 9 ^ It 

i=lj = l ^ 

6 N M 8 N M 

= X) X) S ^ij^ij + X) X) S B\jVij 
i=3i=lj=3 /=3i=lj=3 

Af M 

+ E E o((/^Ei/ 2)^ + (/^Ei/ 2 )^ + (EEi/ 2)" + (EEi/2)")eb/^r^"- 

2=1 7 = 1 

(3.9) 

Thus, use the discrete Green formula and the definition of A\j, B\^, ^u- 1/2 j> 
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and ci 2 ^j_ii 2 to obtain 



N M 



l,b 



ICIa* + ;^\^\bh + X/ 



li+l/2,j ~ bii-i/2,j 



i=l j=l 









(3.10) 

where e is an arbutary positive number. From the above analysis, we have 



Theorem 3.2 Assume that u and U are the solutions o/(l.l) and (2.8), re- 
spectively, u has continuous 4th derivatives, and (3.4) and (3.5) are satisfied. 
Then, 

\^\a^ + \^u+mu<Q{{hn^ + {hyr), ( 3 . 11 ) 

holds, where Q is a positive constant which does not depend on the partition 

h- 



Proof: Choose e in (3.10) properly. Inequality (3.11) can be obtained from 
(3.4), (3.5), and (3.10) directly. [] 



4 Conclusions 

We have proposed a five point upwind finite difference scheme on nonuni- 
form rectangular partitions for convection-diffusion equations by using a new 
technique of handling the convective term. The proposed scheme is in the 
conservation form, satisfies the maximum value principle, and has a second- 
order error estimate. We have done some numerical experiments, which show 
that the high-order upwind scheme is as accurate as central methods for 
nonconvection-dominated problems and more accurate than full upwind me- 
thods for strong convection-dominated problems, in which the central me- 
thods do not work. Thus we can say that the proposed method can be used 
to simulate realistic fluid flow problems numerically. 
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